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Preface 


The Sea-viewing Wide Field-of-view Sensor (SeaWiFS) is the successor ocean color imaging system to the 
NIMBUS-7 Coastal Zone Color Scanner (CZCS). The SeaWiFS calibration and validation effort (McClain et 
al. 1992) includes laboratory, on-orbit, in situ, and quality control elements which require pre- and post-launch 
measurements and analyses. The CZCS experience demonstrated the need for comprehensive documentation 
and access to digital data from the prelaunch sensor calibration and characterization, high quality in situ data 
for bio-optical algorithm development (Clark 1981, Mitchell and Holm-Hansen 1991), and post-launch vicarious 
calibration (Evans and Gordon 1994). The SeaWiFS calibration and validation program is designed to support 
the collection and archiving of these data. 

The laboratory element is concerned with several activities in support of producing verifiable radiances from 
the instrument. These activities include the prelaunch characterization of the sensor [Barnes et al. (1994)] 
and a continuing series of SeaWiFS Intercalibration Round-Robin Experiments (SIRREXs) held once a year 
(Mueller 1993 and Mueller et al. 1994). The prelaunch laboratory effort involves establishing and updating 
calibration procedures for SeaWiFS data, bio-optical and atmospheric algorithm development, and quality 
control of all of the relevant data sets. The SIRREX program involves the transfer of National Institute of 
Standards and Technology (NIST) scales of spectral irradiance and radiance, through the SeaWiFS Project, to 
all participating laboratories in the science community and to the calibration standards used to calibrate the 
SeaWiFS instrument for radiance responsivity. Data from the SIRREX and prelaunch SeaWiFS calibration 
provided by the Hughes/Santa Barbara Research Center (SBRC) and Orbital Sciences Corporation (OSC) are 
included in the calibration and validation database. 

The on-orbit sensor calibration is composed of several types of measurements including solar, lunar, and internal 
electronic data collection (Woodward et al. 1993). In addition, the sensor engineering data will be routinely 
monitored. These data and related analysis products will be archived within the calibration and validation 
database in order to track long term system performance. 

The in situ element incorporates measurements of water-leaving radiances and other related quantities, from 
ships and moorings, to develop a vicarious calibration data set based on geographically diverse samples of oceanic 
and atmospheric data sets. The bio-optical database includes historical pigment and optical data, e.g.. CZCS 
NIMBUS Experiment Team (NET) data, which will be expanded over time from 1) deployments of the project- 
sponsored Marine Optical Characterization Experiment (MOCE) Team, 2) an optical time series collected from 
a marine optical buoy (MOBY), and 3) other field expeditions producing data in keeping with an evolving set 
of optical protocols (Mueller and Austin 1992). 

For these measurements to be of use to the pre- and post-launch activities associated with calibrating and 
validating SeaW iFS data, a well documented archive which is easily accessed by all potential users is needed. 
This requirement has produced the SeaW r iFS Bio-Optical Archive and Storage System (SeaBASS); this volume 
of the SeaW iFS Technical Report series is the first step in documenting the architecture of this database. 
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Abstract 


This document provides an overview of the Sea-viewing Wide Field-of-view Sensor (SeaWiFS) Bio-Optical 
o rC u %cQ d St ° ra ? r e Systel ^ (SeaBASS), which will serve as a repository for numerous data sets of interest to the 
SeaU S Science Team and other approved investigators in the oceanographic community. The data collected 
will be those data sets suitable for the development and evaluation of bio-optical algorithms which include 
results from SeaWiFS Intercalibration Round-Robin Experiments (SIRREXs), prelaunch characterization of 
the SeaW iFS instrument by its manufacturer— Hughes/Santa Barbara Research Center (SBRC), Marine Optical 

Ch JT,“ n Exp f nment ( MOCE ) crises, Marine Optical Buoy (MOBY) deployments and refurbishments, 
and field studies of other scientists outside of NASA. The primary goal of the data system is to provide a simple 
mechanism for querying the available archive and requesting specific items, while assuring that the data is made 
available only to authorized users. The design, construction, and maintenance of SeaBASS is the responsibility of 
the SeaWiFS Calibration and Validation Team (CVT). This report is concerned with documenting the execution 
of this task by the CVT and consists of a series of chapters detailing the various data sets involved The topics 
presented are as follows. 1) overview of the SeaBASS file architecture, 2) the bio-optical data system, 3) the 
historical pigment database, 4) the SIRREX database, and 5) the SBRC database 


PROLOGUE 

The Sea-viewing Wide Field-of-view Sensor (SeaWiFS) 
ocean color radiometer is scheduled to be launched in late 
1994 with an expected lifetime of five years. The Na- 
tional Aeronautics and Space Administration’s (NASA) 
Goddard Space Flight Center (GSFC) is tasked with car- 
rying out a program to acquire the global SeaWiFS data 
set, validate and monitor its accuracy and quality, pro- 
cess the radiometric data into geophysical units using a set 
of atmosphereic and bio-optical algorithms, and distribute 
the final products to the scientific community through the 
GSFC Distributed Active Archive Center (DA AC). The 
SeaWiFS data products figure as prominent components 
of major scientific programs studying global change, in- 
cluding the Joint Global Ocean Flux Study (JGOFS), the 
World Ocean Circulation Experiment (WOCE), and the 
Global Ocean Ecosystems dynamics (GLOBEC) programs. 

Two important goals of the SeaWiFS Project are to de- 
termine, from the satellite radiance measurements, a) nor- 
malized water-leaving radiance with an accuracy of 5%. 
and b) chlorophyll a concentration with an accuracy of 
30% (Hooker et al. 1992). The amount of historical data 
suitable for developing and validating the algorithms em- 
ployed is very limited because of the specific suite of si- 
multaneous observations and the radiometric accuracies 
required. (Because the desired result is to validate space- 
borne radiances to within 5%, the comparative m situ ra- 
diometric measurements must be calibrated to be accurate 
within less than 5%.) 

For the Coastal Zone Color Scanner (CZCS), which col- 
lected data from 1978—86, few algorithms have been pub- 
lished and these rarely show broad agreement at the hoped 
for level. The algorithmic differences are primarily the re- 
sult of two factors: 1) the data used are collected from 


regimes having different bio-optical properties and 2) the 
instrumentation and sampling techniques employed are dif- 
ferent. 

In order to minimize observational errors, the SeaWiFS 
Project sponsored a multidisciplinary workshop to outline 
the observations and sampling protocols required for bio- 
optical algorithm development (Mueller and Austin 1992). 
One of the consequences of the workshop was the establish- 
ment of a series of SeaWiFS Intercalibration Round-Robin 
Experiment. (SIRREX) activities to establish and advance 
the state of the art for making validation measurements 
to verify the radiometric precision and long-term stability 
of the SeaWiFS instrument’s radiance responsivities, and 
to validate the atmospheric correction models and algo- 
rithms used to convert SeaWiFS radiances to normalized 
water-leaving radiances. One of the principal approaches 
to this critical aspect of validation will be frequent direct 
comparisons between SeaWiFS estimates and in situ mea- 
surements collected during Marine Optical Characteriza- 
tion Experiment (MOCE) deployments and from the Ma- 
rine Optical Buoy (MOBY). 

I he' only economically feasible approach for minimizing 
spatial biases is to maximize the acquisition of a globally 
distributed sampling of in situ measurements by soliciting 
contributions of data from the oceanographic community 
at large, and to combine them with data collected from 
SeaWiFS sponsored field programs (MOCE and MOBY 
deployments) and instrument calibrations (SIRREX activ- 
ities) in an easily’ accessed database. The database can 
then be used to formulate and compare advanced CZCS- 
type algorithms with the standard algorithms and to de- 
velop preliminary SeaWiFS chlorophyll-like pigment algo- 
r it Inns. 

I he acquisition of the historical and contemporary data 
sets requires the implementation of quality control (QC). 
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documentation, and cataloging procedures and the design 
of a database structure suitable for bio-optical data with 
a user friendly interface. This task is the responsibility 
of the SeaWiFS Calibration and Validation Team (CVT). 
This report is concerned with documenting the execution 
of this task by the CVT and consists of a series of chapters 
detailing the various data sets included in the SeaWiFS 
Bio-Optical Archive and Storage System (SeaBASS). A 
short synopsis of each chapter is given below. 

1. Overview of the SeaBASS File Architecture 

The SeaBASS file architecture is concerned with main- 
taining easy data access while accomodating a variety of 
operating systems. A text format is used with a simple 
file structure composed of two parts: a header block and 
a data block. The former is composed of a unique vo- 
cabulary expressed as a set of keywords and values; the 
latter is composed of a set of observables expressed as a 
tab-delimited matrix — the so-called spreadsheet format. A 
condensed file naming convention is used for systems uti- 
lizing a maximum of 11 characters; an expanded file nam- 
ing convention, which is more readily deciphered, is also 
available when requesting data. To ensure a complete de- 
scription of the data, additional format conventions are 
recommended for all data collection efforts. 

2. The Bio-Optical Data System 

The CVT has developed a software system which allows 
approved investigators to query and order holdings from 
SeaBASS. As of this writing, the archive includes data pro- 
vided by SIRREX exercises, NIMBUS Experiment Team 
(NET) deployments, the Bermuda Atlantic Time Series 
(BATS), the Bermuda Bio-Optical Profiler (BBOP), the 
Center for Hydro-Optics and Remote Sensing (CHORS), 
WOCE/JGOFS activities, Hughes/Santa Barbara Re- 
search Center (SBRC) prelaunch calibration data, and over 
50 historical pigment data sets. Data from MOCE cruises 
and MOBY will be added to the archive after SeaW iFS is 
launched. The CVT has made each of the data files in the 
archive self-describing by adding a metadata header which 
includes information such as the file s spatial and tempo 
ral limits, parameters, and investigators. The metadata is 
stored in a Sybase relational database, which also tracks 
information relating to system users and the status of data 


orders. Authorized users will be able to query the contents 
of the archive through a graphical user interface (GUI) 
built with the Interactive Data Language (IDL) software 
and interfaced to the Sybase tables. This chapter describes 
the system file naming conventions and header keywords, 
database schema, data holdings, and user interface. 

3. The Historical Pigment Database 

The SeaWiFS calibration and validation historical pig- 
ment database was assembled for evaluating biooptical al- 
gorithms, and the quality control of derived products from 
SeaWiFS and other satellite ocean color sensors. As of this 
writing, there are more than 330,000 independent pigment 
observations in the database, representing over 50 data 
sets amassed between 1956 and 1994 from a wide variety 
of investigators and oceanographic expeditions. This chap- 
ter describes the structure of the database, the current set 
of holdings, the procedures for submitting pigment data 
to the SeaWiFS Project, and the methods for converting 
data contributions from the data format in which they are 
submitted to the database format. 

4 . The SIRREX Database 

This chapter describes the procedures used in obtaining 
and archiving the SIRREX data sets. These data sets pro- 
vide a calibration history for the various lamps, spheres, 
plaques, and radiometers used as a radiometric standard 
for field measurements. Currently, there are two SIRREX 
data sets in the archive. This chapter contains the file 
naming and header protocol used for SIRREX data, the 
procedure for archiving the original data, and an example 
of an archived file. 

5. The SBRC Database 

This chapter describes the naming convention and pro- 
cedures used in archiving the SeaWiFS prelaunch calibra- 
tion data obtained from SBRC. The archive contains data 
from bright target recovery (BTR). absolute spectral re- 
sponse (ASR). line spread function (LSF), and tempera- 
ture dependence measurements as well as images of the 
sun, moon, sky, and an integrating sphere. This data set 
contains characterization information on the SeaW iFS in- 
strument provided by the manufacturer. As such, it serves 
as a baseline to the operation of the instrument on orbit. 


2 


Hooker, McClain, Firestone, Westphal, Yeh, and Ge 


Chapter 1 


Overview of the SeaBASS File Architecture 

Stanford B. Hooker 

NASA Goddard Space Flight Center , Greenbelt , Maryland 
James K. Firestone 

General Sciences Corporation, Laurel, Maryland 
Abstract 

The SeaBASS file architecture is concerned with maintaining easy data access while accomodating a variety of 
operating systems. A text format is used with a simple file structure composed of two parts: a header block and 
a data block. The former is composed of a unique vocabulary expressed as a set of keywords and values; the 
latter is composed of a set of observables expressed as a tab-delimited matrix— the scecalled spreadsheet format. 
A condensed file naming convention is used for systems utilizing a maximum of 11 characters; an expanded file 
naming convention, using up to 31 characters which is more readily deciphered, is also available when requesting 
data. To ensure a complete description of the data, additional format conventions are recommended for all data 
collection efforts. 


1.1 INTRODUCTION 

To maintain flexibility and cross-platform access, the 
SeaBASS file architecture utilizes a condensed and expand- 
ed file naming convention. The former makes use of the so- 
called eight- point- three 11-character format designed for 
personal computers (PCs) running the Microsoft disk op- 
erating system (MS-DOS), which has the most restrictive 
naming convention commonly in use; the latter is for any 
computer supporting 31-character file names. 

The condensed file naming convention considers the 
first eight characters as a prefix and the last three as a 
suffix: 

PPPPPPPP.SSS 

(where PPPPPPPP is the prefix and SSS is the suffix). The 
prefix and suffix are used to describe who collected the 
data, the type of data collected, and the circumstances 
surrounding data collection, i.e., when, where, etc. 

All files in the database are stored in the condensed 
format, but users can request data in the expanded format, 
which is more readily deciphered. The expanded file name 
utilizes multiple prefixes to enhance readability and is a 
verbose version of the condensed file name. 

Although upper- and lower-case letters are used in pre- 
senting the file conventions associated with the database, 
the actual codes and identifiers are not case sensitive, thus, 
a in a file name or header is the same as A. Most com- 
puter systems interpret non-alphanumeric characters as 
special characters, so allowed codes and identifiers used 


to build up a file name are restricted to numbers and let- 
ters, that is, 0, 1, . . . , 9, A, B, . . . , Z (hereafter referred to as 
0, . . . , Z), as well as the underscore (_) and the period ( . ). 
When used for most sequential codes, e.g., the day of the 
month, the 0 entry is not used, so the allowed entries are 
1, 2, . . . , 9, A, B, . . . , Z (hereafter referred to as 1, . . . , Z). In 
the expanded file naming convention, the period is used 
more frequently to enhance readability. 

The basic file structure (Fig. 1) is composed of two 
parts: a header block and a data block. The former is 
composed of an agreed upon vocabulary expressed as a 
set of keywords with each keyword and its argument oc- 
cupying one line in the header; the latter is composed of 
a set of observables expressed as a matrix with each row 
in the matrix individually numbered and terminated by a 
carriage return, and each data entry in the row separated 
by a tab — the so-called spreadsheet format. The header 
block begins with the /begin_header keyword and ends 
with the /end_header® keyword. 


/beg in-header (cr) 


/ end_header<D(cr) 


1 ( tab) Value (1,1 ) (tab) Value (1,2). 
2( tab) Value (2 , 1 ) ( tab) Value (2,2). 
3(^6) Value (3 , 1) (£a&) Value (3 , 2) . 

. Value (1,N) (cr) 
Value (2 ,N) (cr) 
Value (3 ,N) (cr) 

M(tab ) Value (M, 1) (tab ) Value (M, 2) . 

. Value (M,N) (cr) 


Fig. 1 . Basic database file structure. 
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1.2 ARCHIVAL PROCEDURES 

The database protocol specifies a header format which 
is inserted at the beginning of every archived file. Any rele- 
vant information in an existing header is incorporated into 
the keyword vocabulary; additional notations are incorpo- 
rated as comment lines, which begin with an exclamation 
point (!). The requirement for tab-delimited data and a 
sequence column often requires modification of the original 
data. While the data is being modified, a consistent nu- 
meric format is used for each data column, which is based 
on the range and precision of the data. Using a consistent 
numeric format allows a quick visual inspection of the data 
to locate errors. 

Investigators submitting data to the database are en- 
couraged to provide data in a form as close to the es- 
tablished protocol as is feasible. The use of commercial 
off-the-shelf (COTS) packages,! like Lotus 1-2-3 or Excel, 
is permitted as long as the basic tab-delimited American 
Standard Code for Information Interchange (ASCII) for- 
mat is adhered to. 

1.3 KEYWORDS 

The keywords are divided into two groups: those re- 
quired for every file and those approved for completing the 
description of the data in the file. One of the advantages 
of the keyword approach is once the vocabulary of the key- 
words is adopted and made unique, automated processing 
is enhanced because the files can be easily parsed. Locally 
defined keywords can be created, perhaps to track post- 
processing of the data. 

A keyword line is made up of a keyword and, where 
required, its value: 

/keyword =value(cr) 

where keyword is an approved keyword that must begin 
with a slash (/) and end with an equals (=) sign except 
for the keywords that denote the beginning and ending of 
the header block (see below); value is a string or numeric 
sequence giving the value to the keyword; and ( cr ) is a ter- 
minating carriage return that all line (or data row) entries 
must have. The value entries are represented by one of the 
following: 

(n), (s), (n)[u], or (s)[u], 

where (n) is a numeric entry composed of characters used 
to represent numbers, (s) is a character string, and [it] is 
a string that gives the units of the keyword value. 

In general, the International System of Units (SI), or 
Systeme International d’ Unites , are used except where 


f Identification of commercial software or equipment to specify 
adequately the experimental problem, does not imply recom- 
mendation or endorsement by the SeaWiFS Project, GSFC, 
or NASA, nor does it imply that the equipment identified is 
necessarily the best available for the purpose. 


traditional usage dictates otherwise. For indicating nu- 
meric ranges, the right bracket (>) is used in place of the 
hyphen (-), which is reserved for negation, and the semi- 
colon is used for the numeric increment. Thus, to indicate 
a measurement from 300-900 nm in steps of 10 nm, the en- 
try would be 300>900 ; 10 [nm] . If the increment is one, the 
entire increment field can be left out. When constructing 
lists, commas are used to separate entries, and ranges are 
used where needed. To indicate all channels out of 6 ex- 
cept channel 2, for example, 1 ,3>6 is used. When denoting 
numbers raised to powers, the caret (“) is used with the 
exponent enclosed in braces, thus, a 10 -2 entry is writ- 
ten as 10“ {-2}. In writing units, the negative exponent 
is preferred; the solidus may be used to emphasize certain 
common unit combinations. 

1.4 FORMAT CONVENTIONS 

The following format conventions are recommended for 
all data collection efforts: 

1. All files are stored and retrieved in the ASCII text 
format. 

2. All files are self-describing and are composed of a 
header block followed by the data. 

3. The data is in a tab-delimited spreadsheet format, 
that is, rows and columns with all column entries 
separated by a tab. 

4. The header block is constructed from a unique vo- 
cabulary set and is made up of a series of one-line 
entries, each composed of a single keyword entry 
with its value, which is terminated with a carriage 
return. A listing of the allowed keyword entries is 
given in the keyword glossary below. 

5. The first header block line contains the keyword 
/begin_header and the last header block line con- 
tains the keyword /end_header<5. All of the inter- 
vening entries are keyword lines or comment lines 
and do not contain the backslash (\) or at-sign (<D) 
characters. 

6. The header block can contain comment lines. All 
comment lines start with an exclamation symbol ( ! ) 
and, like keyword entries, do not contain the back- 
slash (\) or at-sign (0), although they may contain 
white space (blanks). 

7. All header information that is not part of a com- 
ment line is underscore filled, that is, underscores 
are used instead of blanks. This allows for easy 
parsing, since many text translation routines (in- 
side and outside of COTS programs) replace tabs 
with blanks. 

8. All entries are time stamped using the format used 
in the date and time keywords. 

9. All entries start with a sequential sample number, 
so missing data or file corruption can be readily 
discerned. 
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10. Any other information not given in the keyword en- eluded in the header as comment lines (see item 6 above) 

tries that helps interpret the data, either by itself or as separate documentation files, 

or in conjunction with other likely data sets, is in- 
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Chapter 2 


The Bio-Optical Data System 

James K. Firestone 
Eueng-nan Yeh 
Todd L. Westphal 

General Sciences Coloration, Laurel , Maryland 

Charles R. McClain 
Stanford B. Hooker 

NASA Goddard Space Flight Center , Greenbelt, Maryland 
Yuntao Ge 

University Space Research Association, Columbia 7 Maryland 


Abstract 

The CVT has developed a software system which allows approved investigators to query and order holdings 
from SeaBASS. As of this writing, the archive includes data provided by SIRREX exercises, NET deployments, 
BATS collection efforts, BBOP, CHORS, WOCE/JGOFS activities, SBRC prelaunch calibration data, and over 
50 historical pigment data sets. Data from MOCE cruises and MOBY will be added to the archive after SeaWiFS 
is launched. The CVT has made each of the data files in the archive self-describing by adding a metadata header 
which includes information such as the file’s spatial and temporal limits, parameters, and investigators. The 
metadata is stored in a Sybase relational database, which also tracks information relating to system users and 
the status of data orders. Authorized users will be able to query the contents of the archive through a GUI 
built with IDL software and interfaced to the Sybase tables. This document describes the system file naming 
conventions and header keywords, database schema, data holdings, and user interface. 


2,1 INTRODUCTION 

Figure 2 details the overall flow of data through the 
data system. The boxes along the top indicate each of the 
major classifications of data to be stored, including data 
from instrument calibration activities (SIRREX exercises 
and SBRC prelaunch measurements) and the in situ sam- 
pling program (MOCE and MOBY deployments). 

After reception by the Project, each data set that is 
not SIRREX data is assigned a priority based on its (per- 
ceived) demand by the user community. Initially, a two- 
class priority scheme is envisioned. Data sets within the 
high-priority category, category A, will be subdivided, so 
that one file is created for each parameter or granule con- 
tained within the submitted original. This approach has 
several advantages: 

1. It enables the data users to perform queries at 
the parameter level. 

2. It ensures a unique file name for each granule of 
data. 

3. It potentially reduces the size of the files the user 
will need to manipulate. 


4. It simplifies the process of archiving data up- 
dates from the individual investigators. 

Each category A data set will be visually checked to de- 
termine the optimum method for subdividing its contents. 
A program will then be written which performs the sub- 
division, fills missing data with a uniform code of -9999, 
converts time and space coordinates to use standard rep- 
resentations, and tab delimits the data columns. Data sets 
within the low-priority category (category B) will not be 
subdivided initially, and will retain their original format. 
At a later date, a category B data set may be converted 
to category A and subdivided as demand for it increases. 

All output files will be named according to the naming 
conventions applicable to the data, as described in Sec- 
tion 2.4. The name includes the specification of a param- 
eter, which will refer to a specific field for a category A 
data set or to a broader classification, e.g., experiment, for 
a category B data set. The current set of bio-optical pa- 
rameters identified for the files that are not comprised of 
SIRREX data is given in Section 2.2.3. 

After the files are named, a set of keywords are identi- 
fied which completely describe the data set. The keywords 
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SIRREX 


SBRC 


MOCE 


MOBY 


Other Data 


(Cat. B) 


Is splitting the files (one per parameter) desirable? 


Determine the method for splitting the files. 


Read in files, restructure the data, fill in missing 
data with -9999, and tab-delimit the columns. 


Determine the output file names and headers 
from the input files. 


Write the output files (headers and data) while 
replacing any reserved characters. 


Load the headers into Sybase data tables. 


Load comments and headers into master 
catalog for each data set. 


Load the required keywords into an ASCII 
look-up table. 


Copy the data to the magneto-optical (MO) jukebox 
(separate integral volumes for each data type). 


User Query 


Data on the MO jukebox. 


Verify user query and copy data to disk 
or tape for user. 


Fig. 2. The overall flow of data through the data system. 
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consist of both a required set, containing spatial and tem- 
poral information, column headings, and investigator(s), 
plus a more open-ended supplementary set approved for 
the data set. Section 2.5 lists possible values for both 
the required and supplementary sets. The header infor- 
mation is inserted at the beginning of each file. Following 
the header are any comments from the provider’s origi- 
nal file(s), which may include the original headers. All 
comment lines are preceded in the output files by an ex- 
clamation character ( !). The final step in the file creation 
is to remove all special characters, such as backslashes and 
boolean operators, which could be misinterpreted by the 
end user’s computer operating system. 

In order to facilitate the query of the metadata con- 
tained in the headers, their contents are loaded in three 
different places: 1) a set of Sybase relational database ta- 
bles, 2) an ASCII look-up table containing just the re- 
quired keywords in a columnar one line entry per file, and 
3) a master catalog containing the headers and any inter- 
nal comments for all files belonging to each unique data 
type. The current database table structure, which is de- 
signed to be of third normal relational form, is shown in 
Table 1. Note the set of required keywords for each major 
class of data (SIRREX, SBRC, or other bio-optical) are 
stored in separate tables, which serve as look-up tables in 
the database. Within these tables are foreign keys pointing 
to primary keys in other tables which store information on 
experiments, investigators, file names, etc. 

There is also a set of administrative tables which track 
the users of the system, their requests for data, their ac- 
cess to the system, their comments on the system, and the 
Project’s verification of their individual data orders (the 
fourth column in Table 1). 

2.2 CONDENSED CONVENTION 

The condensed file name follows the previously men- 
tioned eight-point- three format and is of the form: 

DDDPPPII . TNN 

(composed of a prefix. DDDPPPII. and a suffix, TNN). The 
prefix is composed of three parts: DDD is the date code, 
PPP is the position code, and II is the data identifier code. 
The suffix is composed of two parts: T is the type code and 
NN is the sequence code. 

2.2.1 Condensed Date Code 

The condensed date code for the data, DDD, indicates 
the number of days since 1 January 1950. Each column in 
the DDD entry consists of an alphanumeric entry incremen- 
ting from 0. .... 9. then A. .... Z. A code of 000 is reserved 
for the case where the date is unknown. A code of 001 
represents 1 January 1950, 009 represents 9 January 1950, 
00A represents 10 January 1950, 00Z represents 4 February 
1950, 010 represents 5 February 1950, etc. 


2.2.2 Condensed Position Code 

The condensed geolocation position for the data, PPP, is 
based on a series of global 2.5° latitude by 2.5° longitude 
bins. Each column in the PPP entry consists of a letter 
in the range A, . . . , Z. The first bin, AAA, has its northwest 
corner at (-180,90); the second bin, AAB, has its northwest 
corner at (-177,5,90); the 145th bin, AF0, has its northwest 
corner at (-180,87.5). An entry of ZZZ is reserved for the 
case where the location is unknown. 

2.2.3 Condensed Identifier Code 

Each character in the condensed data identifier code, 
II, consists of either alphabetic (A, . . . , Z) or numeric val- 
ues (0, ... ,9). For category A data sets, each parameter 
or set of related parameters is saved as a separate file, and 
II is expressed as a value in the range AA, . . . , ZZ. Three 
of the characters, A, S, and Z, if present as the left-most 
character in the II code, are reserved for representing air- 
borne, surface, and subsurface parameters, respectively. A 
value of AF, for example, represents airborne fluorescence 
data. 

For category B data sets, II is the encoded name of 
the experiment or individual responsible for the collection 
of the data. For example, ON represents NET optical data. 
Table 2 contains a list of the experiments of the current 
data holdings, including the experiments referenced in the 
number II entries. 

The condensed identifier codes with the corresponding 
verbose descriptions are as follows: 

1M MOCE-I cruise data 
2M MOCE-II cruise data 
0W Marra WOCE/JGOFS data 
0B Siegel BBOP data 
IB Michaels BATS pigment data 
0C CHORS optical data 
0J JGOFS EqPac data 
ON NET optical data 
IN NET pigment data 

AF Airborne fluorescence 

AH Air relative humidity 

AI Airborne spectral downwelled irradiance 

AP Atmospheric pressure 

AR Airborne spectral upwelled radiance 

AT Air temperature 

BA Spectral beam attenuation coefficient 
BC Beam attenuation coefficient 
BS Spectral backscattering coefficient 

CA Satellite pigment (chlorophyll a) concentration 
(computed) 

CC Coccolith concentration 

CD Diffuse attenuation coefficient, downwelled ir- 
radiance 

CF Chlorophyll a fluorescence profiles 
CP Chlorophyll a profiles 
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Table 1 . Bio-optical database schema for storing: a) metadata common to all bio-optical data sets, and specific metadata for 
non-SIRREX data sets (left column); b) metadata specific to SIRREX data files (left middle column); c) metadata specific to 
SBRC data files (right middle column); and d) administrative information on system users (right column). 


BI00PT_L00KUP 

SIRREX-LOOKUP 

SBRC.L00KUP 

REQUESTS 

Start date 

Column Headings 

Column Headings 

IFI Request ID 

Start time 

[FI Configuration ID 

[FI Archive file ID 

Date and time of request 

End date 

[FI Data type ID 

fFI Data type ID 

|F1 Local account ID 

End time 

Date 

Date 

[FI Archive file ID 

Start latitude 

IF1 Experiment ID 

[FI Instrument ID 

Date verification sent 

Start longitude 

[FI Testing group ID 

[FI Investigator ID 

Date response received 

End latitude 

[FI Lamp/sphere/plaque ID 

Number of data rows 

Date and time filed and 

End longitude 

[FI Instrument ID 

IFI Original file ID 

copied from MO to 

[FI Investigator ID 

IF1 Investigators ID 

IFI Run type ID 

disk or tape 

IFI Affiliation ID 

Number of data rows 

Sequence number 

Date and time file sent to 

rFl Experiment ID 

IF1 Original file ID 

Band 

or copied by user 

[FI Data type ID 

IF1 Run type ID 

[FI Detail record ID 

ACCOUNTS 

Column headings 

IF1 Archive File ID 

SBRC 

IFI Local account ID 

Sequence number 

Sequence number 

[FI Detail record ID 

Account Name 

IFI Data file ID 

[FI Detail record ID 

Collection aperture 

Long account name 

IFI Archive file ID 

SIRREX 

Collection aperture 

Mailing address 

IFI Original file ID 

[FI Detail record ID (72 

units 

Telephone number 

[F] Detail record ID 

additional fields 

IFOVf 

E-mail address 

INVESTIGATORS 

and units where 

IFOV units 

IFI Preferred media type ID 

fPl Investigator ID 

applicable) 

Counts per amp 

[F| Preferred verification 

IF] Affiliation ID 

CONFIGURATIONS 

Counts per amp 

type ID 

Name 

|F1 Configuration ID 

units 

LOGINS 

Mailing address 

Name 

Factor 

IFI Login ID 

Telephone number 

GROUPS 

Filter 

[FI Local account ID 

E-mail address 

[FI Testing group ID 

Filter units 

Login date and time 

DATA_TYPES 

Name 

Information file 

Logout date and time 

IFI Data type ID 

LAMPS 

Lamp 

IF] Request ID 

Name 

[FI Lamp/sphere/plaque ID 

Modification status 

[F| Comment ID 

BIOOPT 

Name 

Slit 

MEDIA 

IF] Detail record ID 

INSTRUMENTS 

Slit units 

IFI Preferred media type ID 

Start time units 

|F1 Instrument ID 

Time 

Name 

End time units 

Name 

Time units 

COMMENTS 

Start latitude units 

RUN TYPES 

Mirror side 

|F| Comment. ID 

End latitude units 

[FI Run types ID 

Comment 

[FI Local account ID 

Start longitude units 

Name 

Wavelength range 

Text of comment 

End longitude units 


Wavelength range 

Date and time of comment 

Documentation file 


units 

Resolution, if a problem 

AFFILIATIONS 



Date of resolution 

FPl Affiliation ID 



Person resolving problem 

Name 



VERIFICATION 

EXPERIMENTS 



IFI Preferred verification 

fPl Experiment ID 



type ID 

Name 



Name 

DATA.FILES 




[FI Data file ID 




Name 




ORIGINAL-FILES 




[FI Original file ID 




Name 




ARCHIVE-FILES 




[FI Archive file ID 




Name 




Pi = Primary key. 

IFI = Foreign key. 

f Instantaneous Fie ld-of- View 
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Table 2. Currently held data sets (as of June 1994). 


Date(s) 

Name 

Provider(s) 

Location 

Description 

1992 

MOCE-I§ 

Clark, Hooker, and Trees 

Monterey Bay 

Optics and Pigments 

1993 

MOCE-II§ 

Clark, Hooker, and Trees 

Gulf of California 

Optics and Pigments 

1975-79 

NET Stations 

Clark 

Coastal US 

Optics and Pigments 

1978-81 

NET Along-Track 

Clark 

Coastal US 

Optics and Pigments 

1991 

WOCE/JGOFS 

Marra 

Tahiti— ► Hawaii 

Optics and Pigments 

1992 

JGOFS EqPac 

Numerous 

Equatorial Pacific 

Optics and Pigments 

1992-3 

BBOP 

Siegel 

Bermuda 

Optics 

1991 

CHORS/BOFSf 

Trees 

North Atlantic 

Optics 

1993-3 

BATS 

Michaels 

Bermuda 

Pigments 

1992 

SIRREX-1 

Mueller, Hooker, and Johnson 

CHORS 

Instrument Calibration 

1993 

SIRREX-2 

Mueller, Hooker, and Johnson 

CHORS 

Instrument Calibration 

1994 

SIRREX-3 

Mueller, Hooker, and Johnson 

CHORS 

Instrument Calibration 

1994 

SBRC 

Roberts and Holmes 

SBRC 

SeaWiFS Calibration 

1956-94 

Historical Pigments 

Numerous 

Global 

>330,000 Observations 


§Data sets will be made available after SeaWiFS launch. 
fBritish Ocean Flux Study 


CS Spectral absorption coefficient 
CU Diffuse attenuation coefficient, upwelled radi- 
ance 

DC Detritus absorption coefficient 
DO Colored dissolved organic matter 
DS Secchi depth 
FA Fulvic acid concentration 
HA Humic acid concentration 
HS Salinity profiles 

IM Inorganic suspended material concentration 

OC Total dissolved organic carbon 
OM Total dissolved organic matter 
ON Particulate organic nitrogen 
OS Organic suspended material concentration 

PA Particle absorption coefficient 
PB Phycobilipigment concentration 
PC Phytoplankton species counts 
PF Particle fluorescence 
PH Phaeophytin concentration 
PO Pigment concentration, other 
PP Primary productivity 

PR Photosynthetically available radiation (PAR) 

PS Particle size spectra 

RD Spectral sky radiance distribution 
RF Rate of flow for aiong-track fluorescence 
RS Spectral (remote sensing) reflectance 
RT Rate of flow for aiong-track transmittance 

SA Surface chlorophyll a concentration 
SI Incident surface spectral irradiance 
SP Surface phaeophytin concentration 
ST Sea surface temperature 
SU Surface upwelling radiance (extrapolated) 

SW Surface wave height 


TC Transmittance 
TF Aiong-track fluorescence 

TI Aiong-track incident (surface) spectral irradi- 
ance 

TM Total suspended material 

TP Temperature profiles 

TR Spectral solar atmospheric transmission 

TS Aiong-track salinity 

TT Aiong-track sea surface temperature 

VS Spectral volume scattering function 
WD Wind direction 

WR Water-leaving radiance (extrapolated) 

WS Wind speed 
WZ Water column depth 

ZD Downwelled subsurface spectral irradiance 
ZR Upwelled subsurface spectral radiance 
ZU Upwelled subsurface spectral irradiance 

2.2.4 Condensed Type Code 

The condensed data type, T, has two values: 0 for cast 
data (i.e., vertical profiles) and 1 for all other data types 
(e.g., aiong-track, fixed mooring, drifters). For the former, 
NN is interpreted as a sequence number; and for the latter, 
NN is interpreted as an hour indicator, as described below. 

2.2.5 Condensed Sequence Code 

Interpretation of the condensed sequence code, NN, de- 
pends on the type code, T. If T=0, the sequence code is 
applied to cast data and is interpreted as a sequential cast 
number; if T=l, the sequence code is applied to non-cast 
data and is interpreted as a starting time of day indicator. 
For cast data, each column in NN is in the range 0, . . . , Z. 
The value 01 represents the first value in the sequence, 09 
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represents the ninth value, OZ represents the thirty-fifth, 
10 represents the thirty-sixth, etc. A value of 00 is as- 
signed for all sequences greater than or equal to 36x36, or 
1,296. 

For non-cast data, the left-most value in NN represents 
the starting hour indicator, in the range A, . . . , X, where A 
is 0 Greenwich Mean Time (GMT) and X is 23 GMT; the 
right-most value in NN represents the 2-minute indicator 
for data start, in the range 0, . . . , T, where 0 is for minute 
values of 0 and 1 past the hour, 1 is for 2 and 3 minutes 
past the hour, and T is for 58 and 59 minutes past the 
hour. 

2.3 EXPANDED CONVENTION 

The expanded file name uses the 0, . . . , Z character set 
of the condensed file names and adds one more: the hy- 
phen (-) which is used as a sub-field embedded separator. 
The underscore (_) is used for undetermined or unknown 
elements, and the period ( . ) is used to separate the ex- 
panded name into smaller elements. Every expanded file 
name uses 24 characters starting with 1) a seven-character 
time stamp that identifies when the data was taken, 2) fol- 
lowed by a three-character position code identifying where 
the data was (geographically) taken, 3) then an eight- 
character code giving the measurement made, and finally 
4) a three-character sequence identifying the type of data 
and its sequence. The expanded file naming convention is 
as follows: 

YYMMMDD . PPP . I I I I I I I I . TNN 

where, YYMMMDD is a (Gregorian month) time stamp (YY 
is the decadal year with 50 representing 1950 and 49 rep- 
resenting 2049, MMM is an abbreviated month name which 
uses the first three characters of the Gregorian month, and 
DD is the day of the month); PPP is the position code (see 
Section 2.2.2); IIIIIIII is an expanded data identifier de- 
scribed below; T is the type code, and NN is the sequence 
code. (The latter two codes, T and NN, are interpreted as 
described earlier in Sections 2.2.4 and 2.2.5.) Purposely 
there is no definitive file extension associated with the ex- 
panded file name, so the user can append one based on 
local requirements. 

The expanded identifier codes with the corresponding 
verbose descriptions are as follows: 

M0CE01CR MOCET cruise data 
M0CE02CR MOCETI cruise data 
Marra-WJ Marra WOCE/JGOFS data 
BBOP-Opt Siegel BBOP data 
BBOP-Pig Michaels BATS pigment data 
CHORSopt CHORS optical data 
JG0FS-EP JGOFS EqPac data 
NEToptic NET optical data 
NETpigmt NET pigment data 

AirFluor Airborne fluorescence 
AirHumid Air relative humidity 


AirDnlrr Airborne spectral downwelled ir radiance 
Atm-Pres Atmospheric pressure 
AirUpRad Airborne spectral upwelled radiance 
Air-Temp Air temperature 
AT-Fluor Along-track fluorescence 
AT-Salin Along-track salinity 
ATIncIrr Along-track incident (surface) spectral 
irradiance 

ATSf cTem Along-track sea surface temperature 
BeamAten Beam attenuation coefficient 

ChlaFIPr Chlorophyll a fluorescence profiles 
ChlaProf Chlorophyll a profiles 
Co c coCon Coccolith concentration 
ColorDOM Colored dissolved organic matter 
DetAbsrp Detritus absorption coefficient 
DifDnlrr Diffuse attenuation coefficient, down- 
welled irradiance 

DifUpRad Diffuse attenuation coefficient, upwelled 
irradiance 

FulvAcid Fulvic acid concentration 
HumiAcid Humic acid concentration 

InorgSMC Inorganic suspended material concen- 
tration 

LWRadExt Water-leaving radiance (extrapolated) 

OSM-Conc Organic suspended material concentra- 
tion 

Part-Abs Particle absorption coefficient 
ParticFl Particle fluorescence 
Part ic ON Particulate organic nitrogen 
PartSpec Particle size spectra 
PhaeoCon Phaeophytin concentration 
PhotoRad Photosynthetically available radiation 
(PAR) 

Phy coCon Phycobilipigment concentration 
PhytoSpe Phytoplankton species counts 
PigConOt Pigment concentration, other 
PrimProd Primary productivity 

R0F-ATF1 Rate of flow for along-track fluorescence 
ROF-ATTr Rate of flow for along-track transmit- 
tance 

Sal-Prof Salinity profiles 

Sat-Chla Satellite pigment (chlorophyll a) con- 
centration (computed) 

SecDepth Secchi depth 

Sfc-Chla Surface chlorophyll a concentration 
Sfc-Temp Sea surface temperature 
SfcPhaeo Surface phaeophytin concentration 
Sf cSpIrr Incident surface spectral irradiance 
Sf cUpRad Surface upwelling radiance 
(extrapolated) 

SfcWavHt Surface wave height 

SolTrans Spectral solar atmospheric transmission 

Spec-Abs Spectral absorption coefficient 
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SpecAten Spectral beam attenuation coefficient 
SpecBack Spectral backscattering coefficient 
SpecRef 1 Spectral (remote sensing) reflectance 
SpSkyRad Spectral sky radiance distribution 
SubDnlrr Downwelled subsurface spectral irradi- 
ance 

SubUpIrr Upwelled subsurface spectral irradiance 
SubUpRad Upwelled subsurface spectral radiance 

TempProf Temperature profiles 
TotalDOC Total dissolved organic carbon 
TotalDOM Total dissolved organic matter 
Total-SM Total suspended material 
Transmit Transmittance 

Vol-Scat Spectral volume scattering function 

WatDepth Water column depth 
Wind-Dir Wind direction 
Wind-Spd Wind speed 

2.4 ARCHIVING PROCEDURES 

After preparation of the file headers is complete, the 
files are archived to the SeaWiFS calibration and valida- 
tion QStar magneto-optical (MO) jukebox. Science Team 
Members (STMs) and other authorized users will be able to 
query the contents of the archive through a GUI built with 
IDL software and interfaced to the Sybase tables (Fig. 3). 

To access the GUI, the user must provide a valid Sybase 
server account name and password. This provides an ex- 
tra measure of security to supplement the security at the 
operating system level governing the user’s initial entry 
to node cal val ,gsfc .nasa.gov. Once within the GUI, 
the user specifies a particular data set class to query (i.e., 
SIRREX, SBRC, or bio-optical), and particular values for 
one or more of the required keywords for the chosen class 
(where appropriate, a pop-up window provides a list of 
valid values for the selected keywords, taken from the data- 
base) . 

After the user runs the query, the software polls one 
or more database tables in order to construct a list of files 
which meet the stated criteria (these are printed in a sepa- 
rate window as shown in Fig. 3). The user can then select 
one or more of the listed files, in order to either request 
the data (and optionally, the catalog of headers and com- 
ments and external documents from the data provider, if 
applicable) or to view the first few records of the selected 
files in a separate GUI window. The windows containing 
the file names and file excerpts are scrollable in both the 
horizontal and vertical directions. 

In addition to making queries, the users can also revise 
their address, phone number, and other personal data in 
the database, or leave a comment for the system admin- 
istrator. Each request results in a record being entered 
in a special Sybase table. In addition, an electronic mail 
(e-mail) message is sent to the bio-optical system manager 
and a verification request (in the form of a letter or tele- 
phone call, depending on the user’s preference) is sent to 


the user. When the response to the verification request is 
received, it is also logged into the database and the data 
can then be retrieved by the user. Choices for file retrieval 
include network or various types of tape, including 9-track, 

8 mm (EXABYTE), and 4 mm digital audio tape (DAT). 
When network transfers are requested, files will be staged 
from the MO to the user’s account for file transfer protocol 
(ftp) access. With the data, an ASCII look-up table will 
be provided which lists gross characteristics of the data 
(i.e., the same ones covered in the required keyword set). 

Each user needing access to the archive sends an e-mail 
message to seabassOcalval .gsfc .nasa.gov in order to 
receive a database user name and password and further in- 
structions. All of the users outside of GSFC will be placed 
in a common group (both at the UNIX level and the Sybase 
level) to control system and database access. A series of 
administrative tables present in Sybase will track the users, 
their access to the system, their requests for data, and their 
comments on the system (see Table 1). Requests for data 
will be monitored by the administrator of the bio-optical 
data system, who will also be responsible for copying the 
requested data to disk or tape, once verification is received 
from the user. 

The proposed breakdown of tasks and milestones within 
the bio-optical data system is detailed in Table 3. A group 
effort will be required to accomplish all of the necessary 
tasks before launch of the SeaWiFS instrument. E-n. Yeh 
is tasked with handling the initial preparation of most of 
the data that is not derived from SeaWiFS activities in- 
cluding renaming, restructuring if necessary, and the in- 
sertion of header information. T. Westphal and Y. Ge 
performed similar functions for the SeaWiFS data sets, 
including SIRREX activities, MOCE cruises, and MOBY 
deployments. J. Firestone handles applications develop- 
ment, including creation, population, and maintenance of 
the Sybase tables, jukebox applications, and development 
of the user GUI interfacing with the database. 

2.4.1 Archived File Example 

An example data file, BQUINZOW . 001, with a complete 
header block (presented with a truncated /experiment en- 
try to save on space), and an abbreviated data block, is 
presented below: 

/begin_header 

/af f iliations=Lamont-Doherty_Earth_Observatory 
/column_headers=Marra 
/data_f ile_name=bquinzOw. 001 
/data_type=CAST 

/Hnnimentation file name=README , statlog , 
oceanmsgl , PCparbat , dtbat 
/end_date=1991SEP04 
/end_lat itude=-12 .013 [DEG] 

/ end_longitude=-151 . 021 [DEG] 

/end_time=22 : 21 :00 . 000 [GMT] 

/experiment=JG0FS-W0CE_Hydrographic .Program 
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Table 3. Bio-optical data system tasks and milestones. 


Brief Description of Activities 


Database design. 

Implement GUI prototype. 

SIRREX header creation and metadata archiving. 

Header creation and metadata archiving for the CTMSj data sets. 
Develop C routines for file renaming. 

First Bio-optical Algorithm working group meeting. 

MOCE’I master log creation. 

Data system overview and historical pigment database documentation. 
Implement operational GUI. 

MOCE header creation and metadata archiving. 

Second Bio-optical Algorithm Working Group Meeting. 

Archive all data to the jukebox. 

User test period. 

System enhancements. 

SIRREX-3 at CHORS, San Diego. __ 


1993 1994 

0 |N I D J I F I M I A I M I J I J I A I S 


f Data sets developed by D. Clark, C. Trees, J. Marra, and D. Siegel (CTMS) 


SeaWiFS Bia-opt ical Archive and Storage System (SeaBASS) 


SIRREX fields j 

Action Menu 
jchange date class 
^ Start nen query 

v See query syntax 

v Run query (list conp reseed nanes) 

v Run query (list expanded naieee) 

v Uiew a fils 

,y Order f lies /documentation 
v- Revise personal data 
sy Leave a comunt 
v EXIT program 


Select file to view/file (e) to order 
jj93JUR23.B-20in.X--.VRl B 
j j93JUN16.C40in2.X01.ARl (V) 

1 j93JUHlfl.C4Oin2.X01.ARl 


; /begin.header 
/aperture-diameter*6[ in) 

/column-headers*Sequence-», Tines tamp, Mean, St d„0ev,B_ Samples, Del ay [ns], Channel-*, Bain ,Data_Type, iab-Tenp, Detect or- Temp, Ci 
/comeent slni t ial- tests 
/configuration*!. 25-1. 25-1. 25-1. 25 
■ /data- type*Averaged-Uoltages 
/ dates 1 993 JUN18 
/experinent*SIRREX_Z 
i /gain*l 

| /group-identif ier«CM0RS 
j /group-sub-identif ier*40_Inch-Sphere 
j /instrument. identif ier*SeaWiFS-t ransf er.radloMeter 
; / invest igators=0SFC , CHORS, HI ST 
i /lab.tenperature*25[C ] 

j / la«p-cu r rent "bankl -11 . 03( A] , bank2-ll. 00[ A] ^ I 

l? ; ” n zzzz " z zizzziiizizzr: z nzz zz _ 11 iiziziiiiizzizizzzzzzizizz ziziizi 1 ~ ^ ; 

- ■ ■ < 

Choose an item fron Action Menu 


Fig. 3. Prototype GUI used to access information on bio-optical data tiles. 
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/ investigators= John.Marra , Carol_Knudson 
/original_file_name=p9 10904a. dec 
/sequence_munber= 1 
/start _date=1991SEP04 
/start_latitude=-12 . 013 [DEG] 
/start_longitude=-151 . 021 [DEG] 

/start _time=22 : 21 : 00 . 000 [GMT] 

/end_header® 

3.000000 1 . 295326e+02 

4.000000 1.302285e+02 

5.000000 1 . 291834e+02 

193.000000 1 . 355593e+02 

194.000000 1.357831e+02 

195.000000 1.357619e+02 

2.4.2 Condensed File Name Example 

The sample file name BQUINZ0W.001 represents the fol- 
lowing: 

Date BQU = 4 September 1991, 

Position INZ = grid box having its northwest corner at 
(-152.5,-10), 

Data identifier 0W = Marra WOCE/JGOFS data, 
Type code 0 = cast data, and 

Sequence and time code 01 = first cast in the sequence. 

It is worth noting again that a verbose description of 
the file name is included in the keywords: 

/column_headers, 

/data_type, 

/sequence_number, 

/start .date, 

/start .latitude, 

/start.longitude, and 
/start. time. 

2.4.3 Expanded File Name Example 

Using the same sample file, the date stamp of 4 Septem- 
ber 1991 would be coded as 91Sep04. The position code 
remains unchanged, INZ. The data identifier is written out 
with its expanded representation, Marra- WJ. The sequence 
and time code remains unchanged, 001. Hence the full ex- 
panded file name is coded as: 

91Sep04 . INZ . Marra-WJ . 001 . 

The expanded format uses only 24 of the 31 characters 
normally available to many operating systems capable of 
extended file naming. The remaining characters are avail- 
able for use in local applications, perhaps for tracking post- 
processing of the data. 

2.5 KEYWORD GLOSSARY 

The keyword glossary is composed of two sets for bio- 
optical (excluding SIRREX) data files: required and ap- 
proved keywords. The former must appear in every data 


file, while the latter appear only as needed to fully describe 
the contents of the file. Of course, local use of a data file 
permits the addition of locally defined keywords, perhaps 
for tracking post- processing of the data. 

2.5.1 Required Keywords 

The required keywords for the bio-optical data (which 
excludes SIRREX data) are as follows: 

/begin.header This keyword does not have a value and 
must be the first entry in the series of lines comprising 
the header block. 

/af filiations^) This is a comma-delimited list of af- 
filiations for individuals included in /investigators. 
All spaces in a given affiliation should be filled with 
underscores. 

/column_headers=(s) This is a comma-delimited list of 
headers for the columnar data (parameters) in the in- 
put file. The zth entry in the list is associated with 
the zth column. Units for the parameters should be 
included, enclosed in brackets. All spaces in a given 
column header should be filled with underscores. For 
standardization, the individual entries should be the 
short names defined in Section 2.2.3. 

/data.f ile_name=(s) This is the name of the output file 
using the condensed naming convention described in 
Section 2.2. 

/data_type=(s) The allowed data types are: AL0NG.TRACK, 
CAST, DRIFTER, or MOORING. 

/end_dat e=yyyymmmdd The ending date of the observa- 
tions. In this format, yyyy is the year, mmm is the 
abbreviated three-character Gregorian month of the 
year, and dd is the day of the Gregorian month. The 
month abbreviations use the first three characters of 
the month: Jan is January. Feb is February, etc. 
/end_latitude=(n) [u] This is the ending latitudinal posi- 
tion where the data was collected. The allowed values 
are -90 to 90. with degrees north (of the equator) pos- 
itive. 

/end_longitud e=(n)[uj This is the ending longitudinal po- 
sition where the data was collected. The allowed val- 
ues are -180 to 180, with degrees east (of the prime 
meridian) positive. 

/end_time=(s)[u] This is the ending time of the observa- 
tions. The format for (s) is hh:mm:ss.sss , where hh 
is the 24-hour hour of the day. mm is the minute of 
the hour, and ss.sss is the decimal second of the hour. 
The ss.sss field is optional and the [u] field is used 
to denote GMT. L0C, EST. EDT. etc., nominally [GMT]. 
For example, 12 : 05 : 23 . 300 [L0C] indicates 12:05 and 
23.300 seconds local time, while 12 : 05 : 23 . 300 [GMT] 
indicates the time is GMT (i.e.. relative to 0° longi- 
tude). 

/experiment=(s) This is the formal name of the exper- 
iment for which data was collected. All spaces in 
a given experiment name must be filled with under- 
scores. 
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/ invest igators=(s) This is a comma-delimited list of the 
persons who collected the data. All spaces in a given 
investigator name must be filled with underscores. 
/° r iginal_f ile_name=(s) This is the name of the original 
file submitted to the Project, as originally named by 
the investigator(s). 

/ start _dat e=yyyymmmdd The starting date of the obser- 
vations. In this format, yyyy is the year, mmm is the 
abbreviated three-character Gregorian month of the 
year, and dd is the day of the Gregorian month. The 
month abbreviations use the first three characters of 
the month: Jan is January, Feb is February, etc. 
/start_latitude={n) [u] This is the starting latitudinal 
position where the data was collected. The allowed 
values are -90 to 90, with degrees north (of the equa- 
tor) positive. 

/start_longitude=(n)[u] This is the starting longitudinal 
position where the data was collected. The allowed 
values are -180 to 180, with degrees east (of the prime 
meridian) positive. 

/st art _t ime= (s) [u] The starting time of the observations. 
The format for (s) is hh:mm:ss.sss, where hh is the 
24-hour hour of the day, mm is the minute of the 
hour, and ss.sss is the decimal second of the hour. 
The ss.sss field is optional and the [ti] field is used 
to denote GMT, LOC, EST, EDT, etc., nominally [GMT]. 
For example, 12 : 05 : 23 . 300 [LOC] indicates 12:05 and 
23.300 seconds local time, while 12:05 : 23.300 [GMT] 
indicates the time is Greenwich Mean Time (relative 
to 0° longitude). 

/sequence_number j= (n) This is a counter, used only for 
cast data; this is not needed for other data types, 
/end.header® This keyword does not have a value and 
must be the last entry in the series of lines comprising 


the header block. Note the at-sign (0) at the end. 

There is no specific requirement regarding the ordering 
of the keywords in the files, other than the /begin.header 
and / end.header® keywords, which must be first and last 
respectively. It is recommended, however, that the sec- 
ond through next to last keywords be kept alphabetical to 
maximize the readability of the file. 

2.5.2 Approved Keywords 

The additional approved keywords for bio-optical data 
files are as follows: 

/documentation_f ile_name=(s) This is a list, which is 
comma-delimited, of associated documentation files. 
All spaces in a given documentation file name should 
be filled with underscores. 

2.6 DISCUSSION 

The bio-optical data archive contains files stored in an 
ASCII text format that is independent of platform. Each 
file has a header block containing information regarding 
when, where, how, and by whom the data was taken. This 
header also contains information specifying how many rows 
and columns of data to expect. Headers from original data 
files are placed into bio-optical data keywords, when ap- 
propriate. A sequence column is added if necessary Each 
column of data is written in a single numeric format before 
the file is placed in the archive. The result is a file that 
is useable on many platforms and has descriptive header 
information in a known format that is easily parsed for 
automated processing. This header contains the details of 
how the data was obtained, as well as how much and what 
type of data is contained in the file. 
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The Historical Pigment Database 


James K. Firestone 

General Sciences Corporation, Laurel, Maryland 
Charles R. McClain 

NASA Goddard Space Flight Center, Greenbelt, Maryland 


Abstract 

The SeaWiFS calibration and validation historical pigment database has been assembled for evaluating bio- 
optical algorithms, and the quality control of derived products from SeaWiFS and other satellite ocean color 
sensors As of this writing, there are more than 330,000 independent pigment observations in the database, repre- 
senting over 50 data sets amassed between 1956 and 1994 from a wide variety of investigators and oceanographic 
expeditions This document describes the structure of the database, the current set of holdings, the procedures 
for submitting pigment data to the SeaWiFS Project, and the methods for converting data contributions from 
the data format in which they are submitted to the database format. 


3.1 INTRODUCTION 

Figure 4 depicts the historical pigment data in the con- 
text of the overall SeaWiFS bio-optical data system. The 
data arrives in numerous formats, although it is usually 
stored as either ASCII files or as spreadsheets from com- 
mercial software packages. As a first step, plots of the 
geolocation of each station are used to check the validity 
of the latitude and longitude entries. In a few cases, a sta- 
tion may be displayed over land because of an error in the 
coding of latitude or longitude. In these cases, the associ- 
ated records are not imported into the database they are 
rejected. A high-level check is also made on the magnitude 
of the pigment values; if they appear out of range (usually 
impossibly large), the data is not included. 

As additional QC checks are defined by the Bio-Optical 
Algorithms Subgroup (Bio-Optical Algorithms in Fig. 4) of 
the SeaWiFS Science Team (Hooker et al. 1993), they will 
be incorporated into the processing of new data sets and 
will be run retroactively on the existing archive. After the 
QC checks are completed, the data is made compatible 
with the database structure, as discussed in Section 3.3. 
Once the data is loaded into the database, various statis- 
tics and secondary products are derived as required by the 
Bio-Optical Algorithms Subgroup. Again, the Bio-Optical 
Algorithms Subgroup will be advising the CVT on the 
specific products and statistics to store in the database 
(SeaBASS in Fig. 4). The results are made available, on 
request, to members of the Science Team and authorized 
database users, and will be used in the ocean color product 


verification activity within the SeaWiFS Calibration and 
Validation Program (Fig. 4). 

At some future date, the contents of the database may 
also be made publicly available through the DAAC. Cur- 
rently, the data can be easily listed and displayed through 
the JGOFS object-oriented data system, as described in 
Section 3.5. In Fig. 4, everything within the dashed box 
is the responsibility of CVT personnel. The CVT also 
participates in the Bio-Optical Algorithm Subgroup of the 
SeaWiFS Science Team. 

3.2 PIGMENT DATABASES 

The CVT is maintaining four separate data sets con- 
taining the entire historical pigment database. Two of the 
data sets are resident in the Microsoft FoxPro environment 
on a 486 PC system, and are stored as standard . DBF files. 
These files are, therefore, accessible from other PC data- 
base packages, such as Borland’s dBase IV. One of the 
PC data sets contains only near-surface pigment values at 
depths between 0-10 m, while a second contains pigment 
profiles, i.e., no depths excluded. There is an equivalent set 
of two data sets, with identical structures and data to the 
PC data sets, but stored under the Sybase relational data- 
base management system (RDBMS) running on a Silicon 
Graphics, Inc. (SGI) UNIX workstation. Table 4 shows 
the database schema for the four historical pigment data 
sets and sample records. 
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Fig. 4. The historical pigment data in the context of the overall data system. 


3.3 DATA PROCESSING 

The goal of this activity is to convert the data received 
from outside investigators to a standard form directly im- 
portable into the four databases described above. Since 
the data comes to the Project in various physical forms 
(generally ASCII or spreadsheet) and with different inter- 
nal structures, considerable effort is sometimes needed to 
restructure it. 

The most common problems are encountered when at- 
tempts are made to read vastly different file formats and 
data structures. These problems are as follows: 

1. Physical structure differences: typically, the da- 
ta may be in either an ASCII file or a spread- 
sheet file. Spreadsheets may be output from dif- 
ferent software packages, such as Lotus 1-2-3 or 
Microsoft Excel. 

2. Position representation discrepancies: dissimi- 
larities here include sign conventions, primarily 
in longitude, indicating the hemisphere where 
the data is located; use of directional indicators, 
such as N for north and W for west to replace 
signs, and use of degree/minute/second repre- 
sentations versus the use of decimal degrees. 

3. Non-standard dates and times: for date, some 
investigators have chosen to use the sequential 
day of the year while others have chosen the Gre- 
gorian month and day. The month has been ex- 
pressed as abbreviations (e.g.. Jan for January) 
or numerically (e.g.. 1 for January and 12 for De- 
cember). The time may be expressed as GMT 


Table 4. Database schema for the historical pig- 
ment data sets. In the Format column: x. a, and / 
indicate an integer, character, and a real field, re- 
spectively; the numbers indicate the width of the 

.rJ M ~ 1 1 


1 * I ll lllisaill^. 

Pn ^ u ^ m ^ tter s nam e, data location, or original file name. 
L3J Specify -9999 if missing. 

£D Units of mgm -3 . 

CD Derived from non-HPLC sources, e.g., Turner method. 
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or as local time, and often there is no indication 
of which one is being used. Hour and minute 
values may be colon-separated or stored in sep- 
arate columns in the file. 

4. Field ordering: each investigator uses his or her 
own method for writing the data files. There- 
fore, it is generally necessary to reorganize, and 
in some cases add or remove, columns from the 
data so it conforms with the established output 
standard. 

5. Missing data: if a data value is missing for a par- 
ticular column, it may be left blank or filled with 
almost any invalid data value as a replacement. 

When left blank, data reshuffling is made more 
difficult because most spreadsheets will move 
leftward all columns to the right of the missing 
columns when data is imported. 

6. Field delimiter differences: various types of field 
delimiters have been used, most commonly tabs, 
spaces, and commas. 

Typically, one of three approaches has been used in 
generating a tab-delimited version of the fields listed in 
Table 4 (tab- delimited ASCII files can be easily imported 
into COTS programs). The three approaches are as fol- 
lows: a) use the Freeform software package provided free 
of charge by the National Oceanic and Atmospheric Ad- 
ministration (NOAA) National Geophysical Data Center 
(Habermann 1991), b) use a commercial spreadsheet such 
as Lotus 1-2-3 or Microsoft Excel, or c) write a customized 
program, to reorder the columns or compute standardized 
representations for spatial and temporal parameters. 

The Freeform package, the first approach, allows spec- 
ification of both the input and desired output file formats 
in an ASCII text file, and eliminates the need for writing 
a custom program to unpack each unique input format. It 
also supports conversions between different representations 
for space and time. However, this approach works only if 
the input data is in a fixed set of columns and never varies, 
since the start and end positions of each column must be 
specified. In addition, headers can be read only if they are 
found at the beginning of the file. Nevertheless, Freeform 
has been useful for processing many of the more straight- 
forward data sets. 

Commercial spreadsheets, the second approach, are ex- 
tremely useful when the input data are not in uniform 
columns, but the columns are delimited in some fashion. 
However, as described above, they have difficulty aligning 
data when blanks have been used as the missing value. 

Customized programs, the third approach, are a last 
resort and the most time consuming, since testing of the 
code is involved for each input format. This approach is 
sometimes the only choice when the input cannot be pro- 
cessed with either of the other approaches. For example, 
headers may be interleaved with the data. This sometimes 
happens with station data where the header gives the lo- 
cation, time, and perhaps cruise information. 
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The output of any of the discussed approaches is a tab- 
delimited ASCII file in the format of Table 4. This file 
is then easily imported into FoxPro (using the Append 
Fhom. . . menu option) or Sybase (using the bulk-copy, or 
bcp, utility). 

3.4 DATA HOLDINGS 

Table 5 is a list of the current set of data holdings in 
the historical pigment archive. Column 2 lists the institu- 
tional, or individual, source of the data (entries ingested 
since the fall of 1993 are followed by a t symbol). Column 3 
indicates the name of the provider’s original file, in either 
ASCII or spreadsheet format. Column 4 indicates the gen- 
eral location of the data set. Columns 5 and 6 represent the 
number of unique spatial points (i.e., one per unique loca- 
tion, even over multiple times and depths at that location), 
broken down by data source. The values in column 5 are 
computed independently for each data source, without re- 
gard to whether the data are present elsewhere in the other 
data sets. The values in column 6 are computed over the 
entire archive, then broken down by source, so that no du- 
plicates are found between data sets. In cases where a data 
source contains data found elsewhere in the archive, e.g., 
W. Balcli (University of Miami/Rosenstiel School for Ma- 
rine and Atmospheric Sciences data) , column 6 values w ill 
be smaller than those in column 5. Both columns 5 and 
6 will be much smaller than the total number of observa- 
tions in a data set when the data set is largely composed 
of a time series (e.g., the JGOFS North Atlantic bloom 
data is a time series, so only one unique location is listed 
in columns 5 and 6). 

Figure 5 is a depiction of all data locations in the his- 
torical pigment archive plotted on a world map. A pro- 
gram called PLOTPNT within the UNIX version of the SEA- 
PAK (McClain et al. 1991a and 1991b) software system 
was used to plot the locations, using the latitude and lon- 
gitude columns in each data file as input. The area least 
sampled continues to be the southern ocean, particularly 
the Atlantic and Indian Ocean portions. 

3.5 ACCESSING THE ARCHIVE 

In order to make the pigment archive readily available 
to the scientific community, the CVT has provided the data 
to the JGOFS Distributed Object Oriented Data System. 
This system, developed and overseen by G. Flierl of the 
Massachusetts Institute of Technology (MIT), D. Glover 
of Woods Hole Oceanographic Institution (WHOI), and J. 
Bishop and S. Paranjpe of Lamont-Doherty Earth Obser- 
vatory (LDEO), makes widely scattered data sets avail- 
able to anyone having the JGOFS server installed on their 
Sun Microsystems (Sun), Digital Equipment Corporation 
(DEC), SGI, International Business Machines (IBM) or 
similar UNIX workstations. With the software, data ob- 
jects can be easily listed or displayed in a variety of ways, 
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Table 5. Historical pigment data sets in the archive. Column 5 indicates the number of unique spatial points 
for each data source. Column 6 indicates the number of unique spatial points over the entire data set for each 
source. 


Entry- 

No. 

Data 

Source 

Original 
File Name 

Geolocation 

Description 

Unique Stations in 
Source Database 

1 

French 

7906F.WK1 

Equatorial Atlantic 

29 

29 

2 

French 

7910F.WK1 

Equatorial Atlantic 

49 

49 

3 

French 

7912F.WK1 

Equatorial Atlantic 

39 

39 

4 

Flench 

7992F.WK1 

Equatorial Atlantic 

9 

9 

5 

French 

7996F.WK1 

Equatorial Atlantic 

9 

9 

6 

French 

8001F.WK1 

Equatorial Atlantic 

17 

17 

7 

French 

8201F.WK1 

Equatorial Atlantic 

46 

46 

8 

French 

8301F.WK1 

Equatorial Atlantic 

68 

68 

9 

French 

8303F.WK1 

Equatorial Atlantic 

92 

92 

10 

French 

8401F.WK1 

Equatorial Atlantic 

94 

94 

11 

French 

8403F.WK1 

Equatorial Atlantic 

99 

99 

12 

French 

8601F.WK1 

Equatorial Atlantic 

136 

136 

13 

Dandonneau 

Tri81 

Equatorial Pacific 

1,898 

1,898 

14 

Dandonneau 

Tri82 

Equatorial Pacific 

4,464 

4,464 

15 

Dandonneau 

Tri83 

Equatorial Pacific 

4,739 

4,739 

16 

Dandonneau 

Tri84 

Equatorial Pacific 

3,505 

3,505 

17 

Dandonneau 

Tri85 

Equatorial Pacific 

3,773 

3,773 

18 

Dandonneau 

Tri86 

Equatorial Pacific 

3,333 

3,333 

19 

Chavez 

CHAVEZ . TXT 

Equatorial Pacific 

500 

500 

20 

JARE§ 

J74_75.TXT 

Pacific 

176 

176 

21 

JARE 

J75_76.TXT 

Pacific 

113 

113 

22 

JARE 

J76_77.TXT 

Pacific 

150 

150 

23 

JARE 

J77_78.TXT 

Pacific 

143 

143 

24 

JARE 

J78_79.TXT 

Pacific 

241 

241 

25 

JARE 

J79_80.TXT 

Pacific 

216 

216 

26 

JARE 

J80_81.TXT 

Pacific 

162 

162 

27 

JARE 

J81_82.TXT 

Pacific 

135 

135 

28 

JARE 

DATA7-27.PRN 

Pacific 

1,496 

1,496 

29 

Oregon State U. 

0SU* . ASC 

Pacific 

619 

619 

30 

Russian/Sushkina 

RUSSBI0L . ASC 

Pacific 

126 

126 

31 

Balchf 

♦ .DAT 

Northern Hemisphere 

8,669 

7,946 

32 

Yentsch 

YENTSCH . TXT 

North Atlantic 

2,460 

2,460 

33 

K ono valov- B e 1 j ae va 

RUSSIAN . WK1 

Atlantic 

120 

120 

34 

JGOFS 

JGOFS . ASC 

North Atlantic Bloom 

1 

1 

35 

Atkinson 

BLMPIG.WK1 

South Atlantic Bight 

17 

17 

36 

Yoder 

PREGABPG . WK1 

South Atlantic Bight 

15 

15 

37 

Yoder 

GABEAPIG . WK1 

South Atlantic Bight 

21 

21 

38 

Yoder 

GABISLPG . WK1 

South Atlantic Bight 

12 

12 

39 

Yoder 

GULFPIG . WK1 

Gulf of Mexico 

20 

20 

40 

Northeast Fisheries 

MARMAP . TXT 

Mid- Atlantic Bight 

750 

750 

41 

Muller-Karger 

VENIZPIG . WK1 

Caribbean Sea 

21 

21 

42 

Bar ale 

ADRIAPIG . WK1 

Adriatic Sea 

31 

31 

43 

Russian 

RUSS22AV. ASC 

Indian Ocean 

158 

158 

44 

Russian 

RUSS08PV . ASC 

Indian Ocean 

109 

109 

45 

Wright- Australia 

AAMBER2P . RYD 

Southern Ocean 

46 

46 

46 

Wright-Australia 

ADBEXN0V. 198 

Southern Ocean 

34 

34 

47 

Wright- Australia 

AAMBERTR . ANS 

Southern Ocean 

36 

36 

48 

Wright-Australia 

HEARDISJ.UNE 

Southern Ocean 

61 

61 

49 

Wright-Australia 

SIBEX2PR. YDZ 

Southern Ocean 

46 

45 


§ Japanese Antarctic Research Expedition * File name wildcard specifier. f Recent update to the database. 
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Table 5. (cont.) Historical pigment data sets in the archive. Column 5 indicates the number of unique spatial 
points for each data source. Column 6 indicates the number of unique spatial points over the entire data set for 
each source. 


Entry 

No. 

Data 

Source 

Original 
File Name 

Geolocation 

Description 

Unique Stations in 
Source Database 

50 

Wright- Australia 

WOCE 1 -OCT. 19 

Southern Ocean 

27 

27 

51 

Dieckmann / Germany 

WEDDELL* . TXT 

Weddell Sea 

82 

74 

52 

Dieckmann /Germany 

ANTV_* . WK3 

Weddell Sea 

57 

57 

53 

Sullivan 

AM123CHL . WKS 

Weddell Sea 

244 

244 

54 

El-Sayed 

ELS AY* . WK1 

Southern Ocean 

397 

397 

55 

British 

BIOMASS*. TXT 

Southern Ocean 

474 

474 

56 

New Zealand 

KIWI1978.TXT 

Southern Ocean 

52 

52 

57 

Sakshaug/Holm-Hansen 

P0LSEAII . ASC 

Southern Ocean 

207 

207 

58 

Longhurstf 

BEDFORD . TXT 

North Atlantic 

1,885 

1,885 

59 

Biggst 

BIGGSPR0 . FIL 

Southern Ocean 

34 

34 

60 

Clark Along-Track| 

*.TLK 

United States Coastal 

8,566 

8,566 

61 

Clark Stations! 

*_N0AA_* . DAT 

United States Coastal 

62 

62 

62 

Mitchellf 

R2CHL.ASC 

Southern Ocean 

345 

69 

63 

JGOFS EqPacf 

PIGT+.PRN 

Equatorial Pacific 

154 

154 

64 

Marra JGOFSf 

P16CHL . ASC 

Equatorial Pacific 

25 

25 

65 

Siegel BBOPj 

E-mail messages Bermuda 

28 

28 

66 

Michaels BATSf 

PIG*. DAT 

Bermuda 

17 

17 

Total Number of Stations 

51,759 

50,751 


| Recent update to the database. * File name wildcard specifier. Current as of June 1994. 


and certain derived quantities can also be included in the 
output. J. Bishop of LDEO has written a method (data 
reading program) to accompany the data. Together the 
method and the data form a data object which is accessi- 
ble from the JGOFS software. The GSFC pigment data is 
found in the following object: 

name : :pigsn(/geochem/solar/data/pigs/pigsf c . asc) 

where the internet protocol (IP) address of the system name 
is 129.236.40.87. The CVT intends to provide periodic 
updates of the archive to the JGOFS system. For a com- 
plete description of the design, installation procedures and 
use of the JGOFS software, consult Flierl et al. (1993). A 
capsule summary of the installation procedure taken from 
this document is found in Section 3.7. The contents of 
Flierl et al. (1993) can be found in the doc subdirectory 
(after installation) as a series of .ps (PostScript) or .tex 
(Tj^X.) files for hardcopy output. 

3.6 SUBMISSION GUIDELINES 

The CVT encourages submission of historical pigment 
data sets collected from specialized field expeditions or as 
part of coordinated global programs such as JGOFS or 
WOCE. The general guidelines for data submission are 
detailed in Chapter 1. In addition, there are two additional 
guidelines which apply specifically to the pigment data. 

1. Record format: It is particularly important to 
fill each column for the data records, even if data 


is missing. The use of blanks for missing data 
can make it more difficult to import the data 
into the database. In lieu of blanks for missing 
data, a code of -9999 must be used. Also, the 
content of data headers (e.g., station names, lo- 
cations, times, etc.), when possible, should be 
converted to the columnar representation and 
included with the data rather than remaining 
text entries. 

2. Extra characters : Do not include extra char- 
acters in the data, such as boolean operators. 
There is currently no provision to store this level 
of detail in the database. 

3.7 JGOFS SOFTWARE 

The procedures to install the JGOFS software are as 
follows: 

1. Set up a jgofs account on a local machine to 
store the software. 

2. Log in to the local jgofs account and telnet to 
IP address pimms . mit . edu (18.83.0. 104). 

3. Log in as the account jgofs, using the password 
object$data. The system asks for your ma- 
chine name, possibly the IP address, and user- 
name (the name under which you would like to 
receive mail). 

4. Select option 1 (transfer JGOFS software). The 
system executes an ftp command back to your 
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machine. Press a return if your local user name 
is jgofs; otherwise, enter the user name. Then 
enter the password. 

5. Copy the software to your local machine by typ- 
ing put jgofs. tar and then quit. 

6. Exit from the control menu by typing 0. 

7. On your local system, unpack the tar file by typ- 
ing tar -xvf jgofs. tar. 

8. Edit the file called Makefile. If you are not 
on a Sun workstation, comment out the lines 
for the Sun and activate the lines for your own 
system (a # sign at the start of a line indicates 
a comment). 

9. Type setenv PWD r pwd‘. 

10. Type make. 

11. If this works, type bin/jgof s_serv & to start 
the server. 

12. Then, initialize the system for plotting by typing 
source penplot. 

13. Start the menuing system by typing the com- 
mand source ” jgof s/jgof s. A three-window 
display, with windows labeled xmenu, list, and 


jgofs should pop up. 

14. Click in the bottom portion of the window enti- 
tled xmenu. 

15. Test the software by typing list "#test". A 
printout of hydrographic data retrieved from a 
local data object, which comes with the instal- 
lation, should appear. Type q to quit. 

16. Check the listing of remote objects by typing 
list "#tco2". 

17. If the menu system does not build properly, the 
local machine may be missing portions of the 
full X library. In this case, Glenn Flierl should 
be contacted at either glennQlake .mit . edu or 
glennQpimms .mit . edu for advice. 

18. Normally, the JGOFS server, once started, will 
run until the next system reboot. If the system 
reboots, the user will need to manually start 
the JGOFS server with step 11 above (or you 
can add it to your system startup procedure in 
/etc/rc .local. To stop the server at any time, 
type source bin/stopserver from your local 
jgofs account. 


22 


Hooker, McClain, Firestone, Westphal, Yeh, and Ge 


Chapter 4 


The SIRREX Database 

Stanford B. Hooker 

NASA Goddard Space Flight Center, Greenbelt , Maryland 
Todd L. Westphal 

General Sciences Corporation , Laurel , Maryland 
Yuntao Ge 

University Space Research Association, Columbia, Maryland 
Abstract 

This document describes the procedures used in obtaining and archiving the SIRREX data sets. These data 
sets provide a calibration history for the various lamps, spheres, plaques, and radiometers used as a radiometric 
standard for field measurements. Currently, there are two SIRREX data sets in the archive. This document 
contains the file naming and header protocol used for SIRREX data, the procedure for archiving the original 
data, and an example of an archived file. 


4.1 INTRODUCTION 

In keeping with the SeaB ASS file architecture, SIRREX 
data utilizes a condensed and expanded file naming con- 
vention. The former makes use of the so-called eight-point- 
three 11-character format designed for PCs running MS- 
DOS (the most restrictive naming convention commonly in 
use); the latter is for any computer supporting 31-character 
file names. All files in the database are stored in the con- 
densed format, but users can request data in the expanded 
format which is more readily deciphered. 

4.2 CONDENSED CONVENTION 

Every condensed file name starts with a three-character 
time stamp that identifies when the data was taken, fol- 
lowed by a five- character primary test code identifying 
what was measured and with what instrumentation, and 
then a three-character extension identifying the type of 
data recorded. The condensed file naming convention is as 
follows: 

DDDPPPPP . eee 

where DDD is the date stamp, PPPPP is the primary test 
code, and eee is the file extension. The date stamp is com- 
posed of three characters: YMD where. Y is the year code, M 
is the month code, and D is the day of the month code. The 
primary test code is composed of three sub-codes: SSICC 
where, SS is a two-character source identifier, I is a one- 
character measurement instrument identifier, and CC is a 


two-character configuration identifier. The file extension is 
also further divided into three codes: drs where, d is a one- 
character data type, r is a one-character run type, and s is 
a one-character sequential run character (frequently a nu- 
meric sequence). Thus, the complete condensed file name 
convention is: 

YMDSSICC . drs 

where, once again, the condensed convention makes full 
use of the eight- point- three 11-character format designed 
for MS-DOS PCs. 

4.2.1 Condensed Date Code 

The condensed time code is a three-character code, YMD, 
and is decoded as follows: Y has the allowed values 1, . . . , Z, 
where 1 = 1991, 2=1992, 3=1993, A=2000, and Z=2025; 
M has the allowed values 1, 2, 3, . . . , A, B, C, where l=Jan., 
2=Feb., 3= Mar., A = Oct., B=Nov., and C=Dec.; and D has 
the allowed values 1, 2, 3, . . . , A,B, . . . , V, where 1 = 1, 2=2, 
3-3, A=10, B=1 1, and V=31. If any part of the date field 
is not known, the underscore (_) is used for the unknown 
component. 

4.2.2 Source Identifier Code 

1 he source identifier code is a two-character code which 
identifies what radiance source was employed, usually a 
sphere, lamp, or plaque. Sources are associated with spe- 
cific organizations or institutions, so it is typically made up 
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of a group main designator, G, and a group sub-designator, 
g, both of which have the allowed values 0, . . . , Z. Group 
codes are usually one of the following: 

A for UA, 

B for BSI, 

C for CHORS, 

D for SIO, 

E for SRT, 

G for GSFC, 

H for SBRC, 

I for NIST, 

J for NRIFSF (Japan), 

M for UM, 

N for NOAA/MLML, 

R for NRaD, 

S for Satlantic, 

U for UCSB, and 
W for WFF. 

Group sub-designators are separated into numbers and let- 
ters with numbers indicating spheres and plaques, and let- 
ters indicating lamps. Once combined, the common Gg 
codes are as follows: 

A1 for the UA 10 in Spectralon plaque, 

A A for FEL lamp F296, 

AB for FEL lamp F297, 

B1 for the BSI 10 in Spectralon plaque, 

B2 for the BSI 20 in sphere, 

B3 for the BSI 18 in Spectralon plaque, 

BA for FEL lamp F310, 

BB for FEL lamp F321, 

BC for FEL lamp 91453, 

BD for FEL lamp 91537, 

Cl for the CHORS 40 in sphere with 150 W lamps, 

C2 for the CHORS 40 in sphere with 275 W lamps, 

C3 for the CHORS 18 in Spectralon plaque, 

CA for FEL lamp 90572, 

CB for FEL lamp 91348, 

CC for FEL lamp 91534, 

CD for FEL lamp 91533. 

CE for FEL lamp 91349, 

DA for FEL lamp F12L, 

G1 for the GSFC 42 in sphere, 

GA for FEL lamp F268, 

GB for FEL lamp F269. 

GC for FEL lamp F227. 

GD for FEL lamp F315. 

GE for FEL lamp F316. 

HI for the SBRC 20 in sphere, 

HA for DXW lamp 19, 

HB for DXW lamp SS27, 

HW for the SBRC SeaWiFS instrument, 

JA for FEL lamp F123. 

MA for FEL lamp F12G, 

MB for FEL lamp F12H. 


N1 for the NOAA 10 in plaque (S/N 001900), 

N2 for the NOAA Gamma RS-10A (S/N HL1315), 

N3 for the NOAA Gamma RS-10A (S/N HL382), 

N4 for the NOAA Optronics 420M (S/N 92106057), 

N5 for the NOAA Gamma RS-13 (S/N HL411), 

NA for FEL lamp F307, 

NB for FEL lamp F308, 

RA for FEL lamp F265, 

SI for the Satlantic 18 in Spectralon plaque, 

SA for DXW lamp S724, 

SB for DXW lamp S721, 

SC for FEL lamp 91604, 

SD for FEL lamp 91591, 

SE for FEL lamp 91615, 

U1 for the UCSB 20 in sphere, 

U2 for the UCSB 18 in Spectralon plaque, 

U3 for the UCSB 10 in Spectralon plaque, 

UA for FEL lamp F219, 

UB for FEL lamp F393, 

UC for FEL lamp F303, 

UD for FEL lamp F304, 

UE for FEL lamp F305, 

W1 for the WFF 30 in sphere. 

Note, the HW source identifier code is mentioned here, but 
it is reserved for the SBRC database as described in Chap- 
ter 5. 

4.2.3 Instrument Identifier Code 

The instrument identifier code, I, is a one-character 
code which identifies the instrument used to make the mea- 
surement. It has the allowed values 0, . . . , Z, but is gener- 
ally one of the following: 

1 for the NIST absolute radiometer number 1, 

2 for the NIST absolute radiometer number 2, 

8 for the Satlantic 7-channel radiometer (S/N 8), 

A for the UA (EOS) 7-channel transfer radiometer, 

B for the BSI 6-channel transfer radiometer, 

C for the CHORS 11-channel transfer radiometer, 

I for the GSFC Optronics 746 JSIC, 

P for the NIST PR714 radiometer, 

S for the Satlantic 7-channel radiometer (S/N 26), 

T for the GSFC Optronics 746_T, and 
X for the SeaWiFS 6-channel transfer radiometer 
(SXR). 

4.2.4 Configuration Code 

The configuration code, CC, is a two-character code 
which identifies the configuration of the source and instru- 
ment during the measurement process. CC has the allowed 
values 0 ,. . . ,Z for each character position. Unknown or 
undetermined configurations are coded as . The config- 

uration codes for the various sources are as follows: 
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UA Win Spectralon Plaque A1 
11 viewed at approximately 45° and ~150cm. 

BSI Win Spectralon Plaque B1 
11 viewed at approximately 45° and ~150cm. 

BSI 20 in Sphere B2 

00 aperture closed, 

01 aperture in position 1, 

02 aperture in position 2, 

06 aperture in position 6. 

BSI 18 in Spectralon Plaque B3 
11 viewed at approximately 45° and ~150cm. 

CHORS Spheres Cl and C2 

00 0.000-0.000-0.000-0.000 (all apertures closed), 

01 1.250-1.250-1.250-1.250, 

02 0.875-0.875-1.080-1.080, 

03 0.628-0.628-0.766-0.766, 

04 0.438-0.438-0.531-0.531, 

05 0.312-0.312-0.438-0.438, 

06 0.000-0.312-0.000-0.438, and 

07 0.312-0.000-0.438-0.000. 

CHORS 18 in Spectralon Plaque C3 
11 viewed at approximately 45° and 150 cm. 

GSFC Sphere G1 

00 all lamps off, 

01 1 lamp on, 

02 2 lamps on, 

16 16 lamps on. 

SBRC Sphere HI 

AA 6 5 W lamps and 2 4 W lamps on, 

AB 1 45 W lamp on, 

AC 1 45 and 1 200 W lamp on, 

AD 6 5, 2 45 and 1 200 W lamp on, 

00 all lamps off, 

01 1 200 W lamp on, 

02 2 200 W lamps on, 

11 11 200 W lamps on. 

Lamp Calibrations 

Cl CHORS baffles with the lamp at ~ 100 cm, 

C5 CHORS baffles with the lamp at ~50cm, 

G5 GSFC baffles with the lamp at ~50cm, and 
15 NIST baffles with the lamp at ~50cm. 

NOAA Win Spectralon Plaque N1 
11 viewed at approximately 45° and ~ 150 cm. 

NOAA Gamma RS-10A (S/N HL1315) N2 
W0 without diffuser attachment, and 
DA with diffuser attachment RS-70-1. 


NOAA Gamma RS-10A (S/N HL382) N3 
W0 without diffuser attachment, and 
DA with diffuser attachment RS-70-1. 

NOAA Optronics 420M N4 

01 slide=3, wheel=5, and distance ^100, and 

02 slide=3, wheel=6, and distance ~40. 

NOAA Gamma RS-13 (S/N HL411) N5 
A2 aperture wheel=200, 

A3 aperture wheel =300, 

A4 aperture wheel =400, and 
A9 aperture wheel=900. 

Sat Ian tic 18 in Spectralon Plaque SI 
11 viewed at approximately 45° and 150 cm. 

UCSB 20 in Sphere U1 

UA FEL lamp F219 used at approximately 50 cm 
and 

UB FEL lamp F393 used at approximately 50 cm. 

(If a lamp other than a UCSB lamp is used, its 
two-character code is entered instead.) 

UCSB 18 in Spectralon Plaque U2 

11 viewed at approximately 45° and -150 cm. 

UCSB Win Spectralon Plaque U3 
11 viewed at approximately 45° and 150 cm. 

WFF 30 in Sphere W1 

00 apertures 1 and 2 open (no aperture cards in- 
serted), 

01 aperture 1 open and aperture 2 in position 1. 

06 aperture 1 open and aperture 2 in position 6, 

10 aperture 1 in position 1 and aperture 2 open, 

11 aperture 1 in position 1 and aperture 2 in posi- 
tion 1, 

66 aperture 1 in position 6 and aperture 2 in posi- 
tion 6. 

4.2.5 Data Type 

The data type, d, is a one-character code which iden- 
tifies the t^pe of data collected during the measurement 
process and has the allowed values A Z. but is gener- 

ally one of the following: 

A for averaged voltages, 

C for (responsivity) calibration file. 

D for derived file, 

E for irradiance, 

H for historical file. 

L for radiance, 

T for (text) documentation file. 

V for voltage, and 
- for undetermined or unknown. 
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4.2.6 Run Type 

The run type, r, is a one-character code which identifies 
the type of run executed during the measurement process 
and has the allowed values 0, . . . , Z, but is generally one of 
the following: 

C for sphere radiance calibration using a cosine 
collector, 

I for sphere irradiance, 

L for lamp irradiance calibration, 

M for lamp to lamp ratios, 

P for plaque radiance instrument calibration, 

Q for sphere responsivity, 

R for sphere radiance calibration using a calibrated 
radiometer, 

S for sphere radiance instrument calibration, 

X for system response file, and 
for undetermined or unknown. 

4.2-7 Sequence Character 

The sequence character, s, identifies which run in a 
series of runs was executed during the measurement pro- 
cess and has the allowed values 0, . . . , Z; the underscore 
(_) is reserved to denote inapplicable, undetermined, or 
unknown entries. 

4.3 EXPANDED CONVENTION 

The expanded file name uses the 0, . . . , Z character set 
of the condensed file names and adds one more: the hy- 
phen (-) which is used as a sub-field embedded separator. 
The underscore (_) is used for undetermined or unknown 
elements and the period ( . ) is used to separate the ex- 
panded name into smaller elements. Every expanded file 
name uses 22 characters starting with 1) a seven-character 
time stamp that identifies when the data was taken, fol- 
lowed by 2) a six-character primary test code identifying 
what was measured, then 3) a three-character code giv- 
ing the instrumentation used and its configuration during 
data collection, and finally 4) a three-character sequence 
identifying the type of data. The expanded file naming 
convention is as follows: 

YYMMMDD . GEEEEE . ICC . drs, 

where, YYMMMDD is a Gregorian time stamp (YY is the deca- 
dal year with: 91 representing 1991, 92 representing 1992, 
00 representing 2000, and 25 representing 2025, MMM is an 
abbreviated month name which uses the first three char- 
acters of the month, and DD is the day of the month); G 
is the group code (described earlier in the Source Identi- 
fier Code section of the condensed file naming convention); 
EEEEE is an expanded source identifier described below; I 
is the Instrument Identifier Code ; CC is the Configuration 
Code; d is the Data Type; r is the Run Type; and s is the 
Sequence Character. Note that YY covers only the range 
1991-2025, for consistency with the condensed SIRREX 


convention described in Section 4.2. Purposely there is no 
definitive file extension on the expanded file name, so the 
user can append one based on local requirements. 

The complete expanded source identification code is 
made up of the group code G and an expanded source name 
EEEEE. The hyphen is used to ensure a dense five-character 
source name. The common GEEEEE codes are as follows: 

A-lOin for UA 10 in Spectralon plaque, 

A-F296 for FEL lamp F296, 

A-F297 for FEL lamp F297, 

B-lOin for BSI 10 in Spectralon plaque, 

B-18in for BSI 18 in sphere, 

B-20in for BSI 20 in sphere, 

B-F310 for FEL lamp F310, 

B-F321 for FEL lamp F321, 

B91453 for FEL lamp 91453, 

B91537 for FEL lamp 91537, 

C-18in for CHORS 18 in Spectralon plaque, 

C40inl for CHORS 40 in sphere w/150 W lamps, 
C40in2 for CHORS 40 in sphere w/275 W lamps, 
C90572 for FEL lamp 90572, 

C91348 for FEL lamp 91348, 

C91349 for FEL lamp 91349, 

C91533 for FEL lamp 91533, 

C91534 for FEL lamp 91534, 

D-F12L for FEL lamp F12L, 

G-42in for GSFC 42 in sphere, 

G-F227 for FEL lamp F227, 

G-F268 for FEL lamp F268, 

G-F269 for FEL lamp F269, 

G-F315 for FEL lamp F315, 

G-F316 for FEL lamp F316, 

H-20in for SBRC 20 in sphere, 

H-SS27 for DXW lamp SS27, 

HDXW19 for DXW lamp 19, 

HSWiFS for SeaWiFS instrument, 

J-F123 for FEL lamp F123, 

M-F12G for FEL lamp F12G, 

M-F12H for FEL lamp F12H. 

N-lOin for NOAA 10 in plaque (S/N 001900), 

N-420M for NOAA Optronics 420M (S/N 92106057), 

N-F307 for FEL lamp F307, 

N-F308 for FEL lamp F308, 

NG1315 for NOAA Gamma RS-10A (S/N HL1315), 
N-G382 for NOAA Gamma RS-10A (S/N HL382), 
N-G411 for NOAA Gamma RS-13 (S/N HL411), 
R-F265 for FEL lamp F265. 

S-18in for Satlantic 18 in Spectralon plaque, 

S-S721 for DXW lamp S721, 

S-S724 for DXW lamp S724. 

S91591 for FEL lamp 91591. 

S91604 for FEL lamp 91604. 

S91615 for FEL lamp 91615. 
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U-lOin for UCSB 10 in sphere, 

U-18in for UCSB 18 in sphere, 

U-20in for UCSB 20 in sphere, 

U-F219 for FEL lamp F219, 

U-F303 for FEL lamp F303, 

U-F304 for FEL lamp F304, 

U-F305 for FEL lamp F305, 

U-F393 for FEL lamp F393, and 
W-30in for WFF 30 in sphere. 

Note again, the HSWiFS source identifier code is mentioned 
here, but it is reserved for the SBRC database as described 
in Chapter 5. 

4.4 ARCHIVING PROCEDURES 

The SIRREX protocol specifies a header format which 
is inserted at the beginning of every file before it is archived. 
Any relevant information in an existing header is incorpo- 
rated into the SIRREX keyword vocabulary. Additional 
notations are incorporated as comment lines, which be- 
gin with an exclamation point (!). The requirement for 
tab-delimited text and a sequence column often requires 
modification of the original data. While the data is be- 
ing modified, a consistent numeric format is used for each 
column of data. This format is chosen based on the range 
and precision of the data taken by a particular instrument. 
Using a consistent numeric format allows a quick visual in- 
spection of the data to locate errors. 

Investigators participating in a SIRREX are encour- 
aged to provide data in a form as close to the established 
protocol as is feasible. Many radiometers brought to SIR- 
REXs use commercial software that creates output in a 
specific format, which mandates modifying the data be- 
fore the file can be archived. Each instrument has its own 
log sheet which is used to record all the relevant informa- 
tion during each measurement. These log sheets are then 
used to generate the header and archive filename for each 
file. 

4.4.1 Archived File Example 

An example file, 36 JG1P16 . LS3, with a complete header 
block, but abbreviated data block, is presented below: 

/begin_header 
/ aperture. diameter=16 [in] 

/color_temperature=2964 [K] 

/column_headers= [run] , [W_m~{-2}_sr~{-l}_nm~{-l}] 

/ comment =Small_Field_St op 
/conf iguration= 16 .Lamps _0n 
/data_type=Radiance 
/date=1993JUN19 
/experiment=SIRREX_2 
/group. i dent if ier=GSFC 
/group.sub.ident if ier=42_Inch_Sphere 
/ informat ion.fi le=7 14.doc . t_2 
/instrument.! dent if ier=NIST_PR7 14 


/ invest igat or s=Unknown 
/lab_temperature=27 [C] 

/lamp_current=bankl.6 . 502 [A] f bank2.6 . 499 [A] 
/number. data_rows= 173 
/original.f ile_name=36IGSP00 . VR3 
/pr714_f ield_stop-0 . 125.X.0 . 125 [deg] 
/pr714_integration_time=628 [msec] 

/pr714_lens. distanced . 3[m] 

/run_type=Sphere_Radiance_Instrument .Calibration 
/sequence_number=3 
/source. distance=51 . 75 [in] 
/sphere.position=centered.(0 ,0) 

/time=17 : 10 : 03 [PDT] 

/total_radiance=207 . 1 [W_m~{-2}_sr~{-l}] 
/wavelength_range=380>1068 ; 4 [nm] 
/wavelength_resolution=5 [nm] 

! Sphere was turned on at 15:49. 

IPR714 was turned on at 9:30. 

/end.header® 

380 1 . 833e-002 
384 1 .962e-002 
388 2. 131e-002 

1060 3 . 963e-001 
1064 3 . 988e-001 
1068 4 . 021e-001 

4.4.2 Condensed File Name Example 

The sample file name 36JG1P16 . LS3 represents the fol- 
lowing: 

Year 3 = 1993, 

Month 6 = JUN, 

Day J = 19, 

Source Identifier G1 — GSFC 42 in sphere, 

Instrument Identifier P = NIST PR714 radiometer, 
Configuration Code 16 = 16 lamps on, 

Data Type L = Radiance, 

Run Type S = Sphere radiance instrument calibration, 

and 

Sequence Number 3 = Run number three. 

It is worth noting again that a verbose description of 
the file name is included in the keywords: 

/configuration, 

/data.type, 

/date, 

/instrument.identif ier, 

/run. type, 

/sequence .number, and 
/source. ident if ier. 

The configuration code used depends on the source used. 
A configuration code of 16 indicates 16 lamps were used if 
the source was the GSFC 42 in sphere. 


27 



The SeaWiFS Bio-Optical Archive and Storage System (SeaBASS), Part 1 


4.4.3 Expanded File Name Example 

Using the same sample file, the date stamp of 13 June 
1993 would be coded as 93JUN13. The GSFC 42 in sphere 
is represented as G-42in. Instrument and configuration 
codes remain unchanged, (P16 in the example file), as do 
data type, run type, and sequence codes, e.g., LS3. The 
expanded file name is coded as: 

93JUN13.G-42in.P16.LS3. 

The expanded format uses only 22 of the 31 characters 
normally available to many operating systems capable of 
extended file naming. The remaining characters are avail- 
able for use in local applications, perhaps for tracking post- 
processing of the data. 

4.5 KEYWORD GLOSSARY 

The keyword glossary is composed of two sets: required 
and approved keywords. The former must appear in every 
SIRREX data file, while the latter appear only as needed 
to fully describe the contents of the file. Of course, local 
use of a data file permits the addition of locally defined 
keywords, perhaps for tracking post- processing of the data, 

4.5.1 Required Keywords 

The required keywords for the SIRREX data sets are 
as follows: 

/begin_header This keyword does not have a value and 
must be the first entry in the series of lines comprising 
the header block. 

/column_headers={s) This is a comma-delimited list of 
the headers for the columnar data in the file. Usu- 
ally the ith entry in the list is associated with the 
ith column. In cases where a sequence of data is 
taken, the data columns are indicated by a range. 
Bands 1 through 6 would be bandl>band6, and 25 
data points and their average would be represented 
as raw01>raw25 , average. 

/ conf iguration=(s) This string is a verbose description of 
the specific configuration used as part of the file name 
and describes the physical configuration or setup used 
for the measurement in question. Usually orientation, 
distances, and other parameters are specific to a type 
of measurement. 

/data_type=(s) This is a verbose version of the data type 
given in the file name. 

/dat e=yyyymmmdd In this format, yyyy is the year, mmm 
is the abbreviated three-character Gregorian month of 
the year, and dd is the day of the Gregorian month. 
The month abbreviations use the first three characters 
of the month: Jan is January, Feb is February, etc. 


/ end_header<9 This keyword does not have a value and 
must be the last entry in the series of lines comprising 
the header block. Note the at-sign (0) at the end. 

/experiment^) This entry identifies the experiment as- 
sociated with the data collected, e.g., SIRREX_2. 

/group_identif ier={s) This string identifies the group 
conducting the test, e.g., CHORS. 

/ group_sub_identif ier=(s) This string identifies the spe- 
cific lamp, sphere, or plaque used during the test. 

/ instrument_identif ier=(s) This string identifies the in- 
strument used to conduct the test. 

/investigators=(s) The string for this keyword contains 
the names of the individuals (separated by commas) 
who collected the data contained in the file. 

/number_data_rows={n) This is the number of data rows 
(or lines) after the /end_headerQ keyword. If this 
number is different than the actual number of data 
rows in the file, there has been some kind of file cor- 
ruption. 

/original_f ile_name={s) Most data is acquired on com- 
puter systems that have already adopted a particular 
file naming convention. Once the data from such a de- 
vice is transferred to the SIRREX database, it receives 
a new name based on the condensed SIRREX naming 
convention. This entry is reserved for keeping track of 
the original file name. 

/run_type={s) This string is a verbose version of the run 
type recorded in the file name and identifies the type 
of run executed during the measurement process. It 
has the allowed values given in the Run Type section 
discussed earlier. 

/sequence_niunber=(n) This number indicates which ex- 
periments, in a series of experiments, are contained 
within this file. In general, the sequence numbering 
is also used to distinguish archive file numbers that 
would otherwise be identical. 

There is no specific requirement regarding the ordering 
of the keywords in the files, other than the /begin.header 
and / end_header@ keywords, which must be first and last 
respectively. It is recommended, however, that the sec- 
ond through next to last keywords be kept alphabetical to 
maximize the readability of the file. 

4.5.2 Approved Keywords 

The approved keywords for the SIRREX data sets are 
as follows: 

/ aperture_diameter=(n) [u] This is the diameter of the 
source aperture. 

/BSI_channels={n) These are the channels used on the 
BSI radiometer, nominally 1>6. 
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/BSI_units={s) These are the units of the BSI radiometric 
measurement, nominally volts. 

/calibration^ ile==(s) This is the file containing the cal- 
ibration information for the radiometer in question. 
The file contains the needed information to convert 
from raw data to radiance. 

/color_temperature={n) [u] This field indicates the equiv- 
alent blackbody temperature of the source in degrees 
Kelvin [K] . 

/comment={s) This field is intended for comment entries 
included on most of the data logs. 

/entrance_slit=(n)[u] This keyword is the monochrom- 
eter entrance slit width. 

/exit_slit=(n)[i£] This is the width of the monochrom- 
eter exit slit. 

/gain=(n) This is the viewing instrument gain and should 
be recorded as a complete factor rather than just an 
exponent, for example, 10 "{2} or 100 rather than 2. 

/inf ormation_f ile={s) This is a text (ASCII) file con- 
taining instrument and file format information. 

/internal_averages_per_reading=(n) This is the num- 
ber of internal averages per reading for the BSI ra- 
diometer, nominally 100. 

/lab_temperature=(n)[u] The temperature of the labora- 
tory where the data was collected in units of Fahren- 
heit [F] or Celsius [C] . 

/lamp_current=(n)[tx] The current applied to the lamp(s) 
used. The units are usually amps [A] . A comma- 
delimited list can be used for multiple lamp scenarios, 
in which case, values for particular banks are indicated 
by preceding the value with bankl_, bank2_, etc., indi- 
cators. For example, /lamp_current=bankl_10 [A] , 
bank2_20 [A] . 

/lamp_voltage=(n)[u] The voltage applied to the lamp(s) 
used. The units are usually volts [V] . A comma- 
delimited list can be used for multiple lamp scenarios, 
in which case, values for particular banks are indicated 
by preceding the value with bankl_, bank2_, etc., indi- 
cators. For example, /lamp_voltage=bankl_10 [V] , 
bank2_20 [V] . 

/number.of _readings_per_external_average=(n) This 
is the number of BSI readings, where each reading is 
a number of internal averages, which are averaged ex- 
ternally, typically 30. 

/plaque_distance=(n)[u] This is the distance between the 
lamp and the plaque. 

/plaque_viewing_angle(n)[u] This keyword is the angle 
of the viewing radiometer with respect to the plaque, 
nominally 45 [deg] . 


/pr714_f ield_stop=(s) These are the field stops used with 
the PR714 radiometer, either 0 . 125_x_0 . 125 [deg] or 
0 • 5_x_l . 5 [deg] . 

/pr714_integration_time={n)[u] This is the integration 
time of the PR714. 

/pr714_lens_distance=(n)[n] The focal distance of the 
lens mounted on the PR714, as determined by the scale 
imprinted on the lens. 

/radius_one={n)[u] This keyword gives the aperture ra- 
dius of the source. 

/radius_tvo=(n)[ti] This is the aperture radius of the col- 
lector. 

/ref erence_lamp_id={s) This keyword is the conventional 
name for the reference lamp used, like, F269, 91453, 
etc., used to illuminate the sphere or plaque. 

/relative_humidity=(n) [u] This is the relative humidity 
of the laboratory air during the test. The units are 
usually [7,] . 

/serial_munber=(s) This is the serial number of the re- 
cording instrument. It is shown here as a general char- 
acter string, since many serial numbers include let- 
ters. 

/source_distance=(n) [tx] This is the distance between the 
viewing radiometer and the source. 

/sphere_diameter=(n) [u] This keyword is the diameter of 
the source sphere. 

/sphere_position=(s) This is the viewung position of the 
radiometer with respect to the source aperture, nomi- 
nally centered. (0,0). 

/sxr_channels=(n) This entry gives the channels used on 
the SXR, nominally 1>6. 

/sxr_lens={s)[u] This entry gives the f - stop and distance 
used with the SXR radiometer; nominally, this key- 
word has a value of f /1 . 4_1 . 2 [m] . 

/sxr_temperature=(n)[u] This is the temperature setting 
of the thermoelectric cooler for the SXR. 

/time=(s)[u] The format for (s) is hh:mm:ss.ss< w'here hh 
is the 24-hour hour of the day, mm is the minute of 
the hour, and ss.ss is the decimal second of the hour. 
The ss.ss field is optional and the [u] field is used to 
denote GMT, EST, EDT. etc., nominally [GMT], 

/total_radiance=(n)[u] This is the total radiance col- 
lected in units of W_nT{-2}_sr~{-l}. 

/ua_aperture=(n) [uj The aperture setting on the UA ra- 
diometer. 

/ua_channels=(.s) The entry gives channels used with the 
UA radiometer. 
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/ua_temperature=(n)[u] The temperature of the UA ra- 
diometer. 

/wavelength_range=(n)[u] This is the wavelength range 
for the data expressed as n\ > 712 ;n3, where n\ is the 
lowest value in the range, ni is the largest value in the 
range, and ri3 is the increment value. The units are 
usually [nm] . 

/wavelength_resolution=(n)[u] This is the spectral reso- 
lution of the instrument performing the measurement. 
Units are typically [nm] . 

4.6 DISCUSSION 

The SIRREX archive contains files stored in an ASCII 


text format that is independent of the platform in use. 

Each file has a header block containing information re- 
garding when, where, how, and by whom the data was 
taken. This header also contains information specifying 
how many rows and columns of data to expect. Head- 
ers from original data files are placed into SIRREX key- 
words, when appropriate, as well as information from the 
log sheet. A sequence column is added if necessary. Each 
column of data is written in a single numeric format before 
the file is placed in the archive. The result is a file that 
is usable on many platforms and has descriptive header 
information in a known format that is easily parsed for 
automated processing. This header contains the details of 
how the data was obtained and how much and what type 
of data is contained in the file. 
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NASA Goddard Space Flight Center , Greenbelt, Maryland 
Abstract 

This document describes the naming convention and procedures used in archiving the SeaWiFS prelaunch 
calibration data obtained from SBRC. The archive contains data from BTR, ASR, LSF, and temperature 
dependence measurements, as well as images of the sun, moon, sky, and an integrating sphere. This data set 
contains characterization information on the SeaWiFS instrument provided by the manufacturer. As such, it 
serves as a baseline to the operation of the instrument on orbit. 


5.1 INTRODUCTION 

The SBRC database is a subset of the SIRREX data- 
base and closely follows that architecture as described in 
Chapter 4. As part of SeaBASS, the SBRC archive has 
both a condensed and an expanded file naming conven- 
tion. The former uses the so-called eight-point-three 11- 
character format used by MS-DOS computers; the latter is 
intended for use on computers that support a 31-character 
file name. All files are archived using the condensed file 
name, but users can request files in the expanded naming 
convention. 

5.2 CONDENSED CONVENTION 

Every condensed file name starts with a three-character 
time stamp that identifies when the data was taken, fol- 
lowed by a five- character primary test code identifying 
what was measured and with what instrumentation, and 
then a three-character extension identifying the type of 
data recorded. The condensed file naming convention is as 
follows: 

DDDPPPPP . eee 

where DDD is the date stamp, PPPPP is the primary test 
code, and eee is the file extension. The date stamp is 
composed of three characters: YMD where, Y is the year 
code. M is the month code, and D is the day of the month 
code. The primary test code is composed of two sub-codes: 
SSRRR where. SS is a two-character source identifier, and 
RRR is a three-character run type. The file extension is 


divided into three codes: dbs where, d is a one-character 
data type, b is a one-character band indentifier, and s is 
a one-character sequential run character (0, . . . , 9). Thus, 
the complete condensed file name convention is: 

DDDSSRRR . dbs 

where, once again, the condensed convention makes full 
use of the eight- point- three 11-character format designed 
for MS-DOS PCs. 

5.2.1 Condensed Date Code 

The condensed time code is a three- character code, YMD, 
and is decoded as follows: Y has the allowed values 1, . . . , Z, 
where 1 = 1991, 2 = 1992, 3=1993, A = 2000, and Z=2025; 
M has the allowed values 1, 2, 3, . . . , A, B, C, where l=Jan., 
2=Feb., 3=Mar., A=Oct., B=Nov., and C=Dec.; and D has 

the allowed values 1. 2. 3, . . . , A. B V. where 1 = 1, 2=2, 

3=3, A=10, B = 1 1, and V=31. If any part of the date field 
is not known, the underscore (_) is used for the unknown 
component. 

5.2.2 Source Identifier Code 

The source identifier code is a two-character code which 
identifies the data source as SBRC and the SeaWiFS in- 
strument. It is always equal to HW. 

5.2.3 Condensed Run Type 

The run type has different allowed values depending on 
the data type (d), although all data types allow DOC to indi- 
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cate their information file. All DOC files use the associated 
data type as the value for this keyword. For example, the 
information file for image data contains /run_type= Image. 
For a data type of A, all files are assigned the run type 

DAT. 

For a data type of B, the first two digits indicate the 
width of the bright target in pixels, and the third character 
represents the illumination level. The allowed run types for 
B data are: 

01M for a 1 pixel wide target, mid-level illumination, 

01H for a 1 pixel wide target, high-level illumination, 

03M for a 3 pixel wide target, mid-level illumination, 

03H for a 3 pixel wide target, high-level illumination, 

06H for a 6 pixel wide target, high-level illumination, 

10M for a 10 pixel wide target, mid-level illumination, 
and 

10H for a 10 pixel wide target, high-level illumina- 
tion, 

For a data type of D, all files are assigned the run type 
LIN. For a data type of I, the three-character code de- 
scribes what the sensor was viewing and has the following 
allowed values: 

INT for an image of the integrating sphere, 

LUN for a lunar image, 

SKY for an image of the sky, and 
SOL for a solar image. 

For a data type of L, the three-character code describes 
in which direction the line spread function was measured 
and has the following allowed values: 

ASC for an along-scan measurement, and 
ATR for an along-track measurement, 

For a data type of T, the file is assigned the run type 
FCT. 

5.2.4 Condensed Data Type 

This one-character code establishes the type of data 
collected and has the following allowed values: 

A for ASR, 

B for BTR, 

D for linearity data, 

I for images, 

L for LSF, and 

T for temperature dependence and telemetry coef- 
ficients. 

5.2.5 Band Code 

The band code is comprised of a single character and 
has the allowed values of 1. . . . , 8, A and N. A number indi- 
cates which one of the possible eight bands is in the data 
file; whereas. A indicates all eight bands are included and N 
indicates no bands are included. The N designation is used 
only on documentation files; all other files must contain 
data from at least one hand. 


5.2.6 Sequence Number 

This digit represents which particular measurement in 
a series of similar measurements is present in the data file. 
The allowed values are 0, . . . , 9. 

5.3 EXPANDED CONVENTION 

The expanded file name uses the 0, . . . , Z character set 
of the condensed file names. The underscore (_) is used 
for undetermined or unknown elements and the period ( . ) 
is used to separate the expanded name into smaller ele- 
ments. Every expanded file name uses 22 characters start- 
ing with 1) a seven-character time stamp that identifies 
when the data was taken, followed by 2) a six-character 
primary test code identifying what was measured, then 

3) a three-character code giving the instrumentation used 
and its configuration during data collection, and finally 

4) a three-character sequence identifying the type of data. 
The expanded file naming convention is as follows: 

YYMMMDD . EEEEEE . RRR . dbs, 

where, YYMMMDD is a Gregorian time stamp (YY is the deca- 
dal year with 50 representing 1950 and 49 representing 
2049, MMM is an abbreviated month name which uses the 
first three characters of the month, and DD is the day of 
the month); EEEEEE is an expanded source identifier which 
is always set to HSWiFS for the SeaWiFS instrument; RRR 
is the Run Type; d is the Data Type; b is the Band Code; 
and s is the Sequence Character. Purposely, there is no 
definitive file extension on the expanded file name, so the 
user can append one based on local requirements. 

5.4 ARCHIVING PROCEDURES 

This protocol specifies a header format which is added 
at the beginning of every archived file, as for all SeaBASS 
files. All relevant information in an existing header is in- 
corporated into the SBRC keyword vocabulary. The re- 
quirement for tab-delimited data columns and a sequence 
column often require modifying the original data. In cases 
where there was a pixel number or wavelength column, this 
was used in place of a sequence number. Although neither 
of these columns starts with 1, both are integer values that 
are incremented by one. 

While the data is being modified, a consistent numeric 
format is used for each column of data. This format is cho- 
sen based on the range and precision of the data. Using a 
consistent numeric format allows a quick visual inspection 
of the data to locate errors. 

5.4.1 Archived File Example 

This example is identical in form to an actual file in 
the SBRC archive. Some of the keyword values have been 
edited to fit in the space provided. Also, not all 1288 lines 
of data are presented. 
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Sample Data File: 383HWSKY.IA5 
/begin.header 
/band=All 

/column_headers=Pixel_# ,Band_l>Band_8 
/comment=looking_due_east ,34. 5_deg_elevation 
/ data_type=Image 
/date=1993MAR07 

/gain=Bl_2 ,B2_2 ,B3.2 ,B4_2 , B5_2 ,B6_2 , B7_2 ,B8_2 

/inf ormation_f ile=45AHWD0C . INO 

/instrumental dent if ier=SeaWiFS 

/invest igators=Alan_Holmes 

/mirror_side=Side_A_and_Side_B 

/modif ication_status=Bef ore_Stray_Light_Mods 

/number_data_rows= 1288 

/original_f ile_name=skyll45 

/ run_t ype=Sky 

/ sequence_number=6 

/time=ll :45 [PST] 

/ end_header<5 


-001 

1023 

0000 

1023 

0000 

1023 

0000 

1023 

1023 

0000 

0022 

0018 

0019 

0021 

0017 

0016 

0020 

0020 

0001 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

0002 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

0003 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

0004 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

0005 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1023 


1280 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1281 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1282 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1283 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1284 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1285 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1023 

1286 

1023 

1023 

0000 

0000 

1023 

1023 

0000 

1023 


5,4*2 Condensed File Name Example 

The sample file name, 383HWSKY.IA5, represents the 
following: 

Year 3 = 1993 
Month 8 = August 
Day 3 = 3 

Source Identifier HW = SBRC and the SeaWiFS 
instrument 

Run Type SKY = Sky 
Data Type I = Image 
Band Number A = All Bands 
Sequence Number 5 = 5th run 

Note that a verbose description of the file name is in- 
cluded in the keywords: /date, /instrument, /run.type, 
/datatype, /band, and /sequence_number. 


5,4.3 Expanded File Name Example 

Using the same file name, 383HWSKY . IAS, the expanded 
codes are: 

Year 1993 = 93 
Month August = AUG 
Day 3 = 03 

Source Identifier HW = SBRC and the SeaWiFS 
instrument 

Run Type Sky = SKY 
Data Type Image = I 
Band Number All Bands = A 
Sequence Number 5 = 5 
So the expanded file name is: 

93AUG03 . HSWiFS . SKY . IA5 

This format uses only 22 of the 31 characters available, 
leaving the remaining characters available for use in post- 
processing applications. 

5.5 KEYWORD GLOSSARY 

The keyword glossary consists of required and approved 
keywords. Required keywords will be found in all archived 
files, while approved keywords appear whenever the rele- 
vant value is known. 

5.5.1 Required Keywords 

The required keywords for the SBRC database follow 
in alphabetical order: 

/band={s) This string indicates the band measured and 
has the allowed values of 1>8, A and N. 

/begin_header This keyword does not have a value and 
must be the first entry in the series of lines comprising 
the header block. 

/column_headers=(s) This keyword is a descriptive list 
of headers, which is comma-delimited, for the colum- 
nar data in the file. Usually the ith entry in the list 
is associated with the ith column. In cases where a 
sequence of data is taken, the data columns are in- 
dicated by a range. For example, bands 1 through 
6 would be recorded as band_l>band_6 and the scan 
number followed by pixels 865 through 1285 would be 
represented as Scan_Number , pixel_865>pixel_1285. 
Each column header is a description of the data, and 
no abbreviations are used. 

/data_type=(s) This is a verbose description of the data 
type recorded in the file name. See Section 5.2.4 for 
allowed values. 

/dat e=yyyymmmdd In this format, yyyy is the year, rnmm 
is the abbreviated three-character Gregorian month of 
the year, and dd is the day of the Gregorian month. 
The month abbreviations use the first three characters 
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of the month: JAN is January, FEB is February, etc. In 
cases where the day is unknown, it is not used (for 
example, 1994DEC). 

/end_header<B This keyword does not have a value and 
must be the last entry in the series of lines comprising 
the header block. Note the at-sign (@) at the end. 

/gain=(s) This indicates the gain setting for each band. 
Each gain value is preceded by a band designation. 
For example, band 1 at gain 1 and band 3 at gain 2 
would be designated Bl_l ,B3_2. 

/instrument_identif ier={s) This string indentifies the 
instrument used during the test. For SBRC data, the 
instrument is always SeaWiFS. 

/investigators 3 (s) The string for this keyword contains 
the names of the individuals (separated by commas) 
who collected the data contained in the file. 

/modif ication_status=(s) The string for this keyword 
indicates whether the data was taken before or after 
the stray light modifications were made to the Sea- 
WiFS instrument. 

/number _data_rows=(n) This is the number of data rows 
(or lines) after the /end_header<2 keyword. If this 
number is different than the actual number of data 
rows in the file, there has been some kind of file cor- 
ruption. 

/original.f ile_name=(s) This string provides a record 
of the file name prior to archiving. The original files 
are held in a separate archive complete with the orig- 
inal format and headers. The original files provide a 
baseline reference to the archived data. 

/run_type=(s) This is a verbose description of the run 
type recorded in the file name. Refer to Section 5.2.3 
for allowed values. 

/sequence_number={n) This is identical to the sequence 
number recorded in the file name. 

5.5.2 Approved Keywords 

The approved keyword list for the SBRC data set is: 

/collect ion_aperture=(n)[u] This describes the aperture 
setting used during an ASR measurement. 

/comment={$) This string is a record of any comment en- 
tered in the original header during testing. 

/counts_per_amp=(n) [u] This specifies the value used to 
convert digital counts into amperes during an ASR 
measurement. 


/factor=(n) This is a unitless number used to force this 
computed data to agree with data obtained from a 
sphere measurement. A number greater than one im- 
plies more amps than expected. 

/f ilter=(n)[u] This specifies the type and thickness of the 
Schott color glass filter used. 

/ifov={n)[n] This specifies the instantaneous field-of-view 
of the SeaWiFS instrument during an ASR measure- 
ment. 

/inf ormation_f ile 3 (s) This string provides the file name 
of an ASCII text file containing additional information 
regarding the measurement setup used and the format 
of the data obtained. 

/lamp=(s) This string indicates the number and wattage 
of lamps used during a BTR test. 

/mirror_side=(s) This string describes which side of the 
half-angle mirror was used during the test. If both 
sides of the mirror were used, it will be indicated here 
and there will be a column to indicate which side was 
measured for each data point. 

/slit=(n)[u] This specifies the horizontal and vertical di- 
mensions of the target slit in a BTR measurement. 

/tim e=hh:mm:ss[u] This keyword records the time in the 
hours:minutes:seconds format. The time zone is indi- 
cated in the unit field as a three character abbrevia- 
tion, EST for Eastern Standard Time, GMT for Green- 
wich Mean Time, etc. 

/wavelength_range=(n)[u] This string indicates the spec- 
tral range, increment, and units used during an ASR 
measurement. For example, a measurement starting at 
400mn. and ending at 850 nm using a 4nm increment 
would be represented as: 400>850 ; 4 [nm] . 

5.6 DISCUSSION 

The SBRC archive contains files compliant with the 
SeaBASS protocol. Each file's header contains the original 
header information as well as information specific to the 
data type and archive file name. The original image files 
were split into separate bands to reduce the size of the 
files and eliminate embedded headers between bands. Line 
numbers were added to any file that did not already have 
them as a means of determining gaps in the data. Files 
that contained space-delimited data were converted to tab- 
delimited. The /column.headers keyword provides a list 
of what data appears in each column. The result is a self- 
contained file header that is similar to that of the SIRREX 
database. 
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Glossary 

ASCII American Standard Code for Information Inter- 
change 

ASR Absolute Spectral Response 

BATS Bermuda Atlantic Time Series 
BBOP Bermuda Bio-Optical Profiler 
BOFS British Ocean Flux Study 

BSI Biospherical Instruments, Incorporated 
BTR Bright Target Recovery 

CHORS Center for Hydro-Optics and Remote Sensing (San 
Diego State University) 

COTS Commercial Off-The-Shelf 
CVT Calibration and Validation Team 
CZCS Coastal Zone Color Scanner 

DAAC Distributed Active Archive Center 
DAT Digital Audio Tape 
DEC Digital Equipment Corporation 
DXW Not an acronym, but a lamp designator. 

E-mail Electronic Mail 
EDT Eastern Daylight Time 
EOS Earth Observing Satellite 
EqPac Equatorial Pacific 
EST Eastern Standard Time 

FEL Not an acronym, but a lamp designator. 

FTP File Transfer Protocol 

GLOBEC Global Ocean Ecosystems dynamics 
GMT Greenwich Mean Time 
GSFC Goddard Space Flight Center 
GUI Graphical User Interface 

IBM International Business Machines 
IDL Interactive Data Language 
IP Internet Protocol 

ISIC Integrating Sphere Irradiance Collector 

JARE Japanese Antarctic Research Expedition 
JGOFS Joint Global Ocean Flux Study 

LDEO Lamont-Doherty Earth Observatory (Columbia Uni- 
versity) 

LOG Local Time 
LSF Line Spread Function 

MIT Massachusetts Institute of Technology 
MLML Moss Landing Marine Laboratory 
MO Magneto-Optical 
MOBY Marine Optical Buoy 

MOCE Marine Optical Characterization Experiment 
MS-DOS Microsoft/Disk Operating System 

NASA National Aeronautics and Space Administration 
NET NIMBUS Experiment Team 
NIMBUS Not an acronym, a series of NASA experimental 
satellites containing a wide variety of atmosphere, 
ice, and ocean sensors. 

NIST National Institute of Standards and Technology 
NOAA National Oceanic and Atmospheric Administration 
NRaD Naval Research and Development 
NRIFSF National Research Institute of Far Seas Fisheries 
(Japan) 

OSC Orbital Sciences Corporation 


PAR Photosynthetically Available Radiation 
PC (IBM) Personal Computer 
PR Photo Research 

QC Quality Control 

RDBMS Relational Database Management System 

SBRC (Hughes) Santa Barbara Research Center 
SeaBASS SeaWiFS Bio-Optical Archive and Storage System 
SEAPAK Not an acronym, an image display and analysis 
package developed at GSFC. 

SeaWiFS Sea-viewing Wide Field-of-view Sensor 
SGI Silicon Graphics, Incorporated 
SI International System of Units (SI), or Systime In- 
ternational d ’ Unites 
SIO Scripps Institution of Oceanography 
SIRREX SeaWiFS Intercalibration Round-Robin Experiment 
S/N Serial Number 

SRT Sigma Research Technology, Incorporated 
STM Science Team Member 
Sun Sun Microsystems 
SXR SeaWiFS Transfer Radiometer 

UA University of Arizona 
UCSB University of California at Santa Barbara 
UM University of Miami 

UNIX Not an acronym, a computer operating system. 

WFF Wallops Flight Facility (NASA) 

WHO I Woods Hole Oceanographic Institute 
WOCE World Ocean Circulation Experiment 
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